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ABSTRACT 


Investigations  of  the  use  of  optical  for  distributing  the  clock  to  a  CMOS  chip  are  reported. 
Two  chips  were  designed,  the  first  incorporating  a  set  of  integrated  detectors  followed  by 
transimpedance  amplifiers.  The  clock  was  distributed  as  an  optical  square  wave.  The 
detected  clock  signals  were  then  amplified,  distributed  on  polysilicon,  and  applied  to  digital 
logic.  The  chips  were  fabricated  at  the  MOSIS  facility.  Testing  of  this  family  of  chips 
yielded  a  maximum  clock  rate  of  about  20  MHz.  A  second  chip  was  designed  in  which  the 
optical  clock  was  used  only  to  synchronize  a  series  of  free-running  electronic  clocks 
distributed  about  the  chip.  Again  the  fabrication  was  performed  at  MOSIS.  While  theory 
and  simulations  predicted  a  maximum  clock  frequency  of  100  MHz,  it  proved  impossible  to 
actually  lock  the  clocks,  due  to  nonuniformities  in  the  fabrication  parameters  across  the 
chip.  Future  work  should  aim  at  modifying  the  design  for  greater  tolerance  to  fabrication 
nonuniformities. 
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I.  Introduction  and  Overview 

This  document  represents  the  final  report  on  Army  Research  Office  Contract  No. 
DAAG29-85-K-0211,  summarizing  the  work  accomplished  from  August  1,  1985  through 
June  30,  1989.  The  report  is  organized  into  five  sections,  the  first  being  this  introduction 
and  overview.  Section  11  is  a  report  on  the  most  recent  work  on  this  contract,  covering  the 
period  September  1988  through  the  contract  end.  The  work  during  this  period  was  devoted 
to  testing  the  most  recent  chips,  obtained  from  the  DARPA  MOSIS  facility.  This  section  of 
the  report  was  written  by  Karin  Sperley  and  A.  Tabibian,  both  Stanford  students.  Section 
in  summarizes  certain  administrative  aspects  of  the  grant 

Appendix  A  is  a  reproduction,  in  entirety,  of  the  thesis  of  Bradley  Clymer,  who  lead  the 
work  on  the  design  of  the  first  optically  clocked  chip  built  under  this  program.  Bradley 
Clymer  received  his  doctorate  in  Electrical  Engineering,  and  is  now  an  Assistant  Professor 
at  the  University  of  Ohio.  Appendix  B  is  a  reproduction,  again  in  entirety,  of  the  thesis  of 
Richard  Welch,  who  designed  the  second  chip  produced  under  this  contract.  Mr.  Welch 
received  the  degree  Engineer  and  is  currently  a  student  in  the  MBA  program  at  the 
University  of  Illinois. 

Thus  to  read  the  work  in  chronology,  start  with  Appendix  A,  then  read  Appendix  B,  then 
finish  with  Section  O. 
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IL  Testing  of  The  Optical  Clock  Distribution  II  Chip 

A.  Introduction 

This  section  describes  the  testing  of  Mr.  Welch’s  Optical  Clock  Distribution  n  (OCD2) 
chip.  The  testing  was  broken  into  two  parts:  testing  without  the  optical  clock  signal 
present,  and  testing  with  this  signal  present 

B.  Testing  Without  The  Optical  Clock  Signal 

The  pre-testing  of  the  chips  consisted  of  checking  the  signals  at  the  output  pins  of  the 
powered  chip  to  determine  if  the  electrical  circuitry  was  functioning  properly.  These 
outputs  include  those  from  the  skew  detector,  frequency  divider,  and  shift  register 
circuitry.  The  pre-testing  is  described  in  section  3.2  of  appendix  B. 

The  results  of  the  pre-testing,  which  was  performed  by  Mr.  Welch,  indicated  that  nearly  all 
of  the  skew  pins  and  error  pins  from  the  shift  registers  on  the  chips  had  the  correct  signals 
present.  However,  most  of  the  outputs  from  the  frequency  divider  circuitry  were  incorrect. 
The  problem  was  twofold.  No  signal  was  present  in  many  cases,  and  when  the  signal  was 
present  the  frequency  was  nearly  half  that  predicted  by  the  design.  These  problems  are 
described  in  more  detail  below. 

Each  of  the  nine  optical  receiver/clock  amplifier  (ORCA)  units  on  a  chip  generate  a  clock 
signal  with  a  voltage  controlled  oscillator.  From  this  signal  two  clock  phases  and  their 
complements  are  generated.  The  frequency  of  the  four  phases  of  the  selected  ORCA  are 
divided  by  64  by  the  frequency  divider  circuitry  and  can  be  viewed  on  the  four  /©ut  pins. 
The  problem  was  that  the  third  and  fourth  phases  of  the  clock  were  not  present  on  the  /out2 
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/out3  of  any  chip,  while  the  other  two  clock  phases  were  sometimes  present  on 
/outo  /out! .  The  diagram  below  shows  the  physical  location  of  the  ORCAs  on  a  chip. 
Each  location  is  divided  into  four  squares  corresponding  to  the  four  clock  phases  generated 
from  that  ORCA's  oscillator.  The  number  in  the  box  is  the  percentage  of  chips  on  which 
that  particular  phase  of  that  particular  ORCA  was  seen  to  be  operational  on  an  /out  pin. 


Diagram  of  Operational  Clock  Phases  and  ORCAs 


Figure  1. 


Furthermore,  the  frequencies  seen  on  the  /out  pins  correspond  to  oscillator  frequencies  of 
50-65  MHZ,  roughly  one-half  to  two-thirds  of  the  100  MHz  value  predicted  by  SPICE 
models. 
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An  interesting  characteristic  of  the  chips'  operation  was  observed  after  the  chips  had 
warmed  up.  The  signal  output  observed  on  an  /out  pin  became  noisy  in  amplitude  and 
phase.  The  signal  became  clear  again  if  the  supply  voltage  was  lowered,  but  would  again 
become  noisy  at  the  lowered  voltage  after  a  time.  Also,  lowering  the  voltage  lowered  the 
oscillator  frequency. 

The  results  of  the  pre-test  indicated  that  Mr.  Welch's  design  did  not  sufficiently  account  for 
fabrication  processing  variations  since  some  of  the  circuits  functioned  and  other  identically 
designed  circuits  did  not.  In  addition  parts  of  the  design  do  not  work  at  all  on  any  chip.  It 
also  seems  likely  that  tlie  SPICE  models  that  were  used  may  not  be  accurate  at  predicting 
the  fabricated  device  performance  since  the  designed  operating  frequency  was  nearly  twice 
that  of  the  actual  device. 

A  simple  test  to  check  the  correct  operation  and  measure  the  transfer  functions  of  the 
voltage  controlled  oscillators  (VCOs)  was  performed  in  addition  to  the  pre-tests.  From  the 
circuit  diagram  in  Figure  2.11  in  appendix  B,  it  can  be  seen  that  in  the  absence  of  light 
input  the  output  of  the  NAND  gate  (LF_Input)  will  always  be  high,  i.e.  equal  to  the  supply 
voltage  VCC.  Referring  to  the  circuit  diagram  of  the  low-pass/lag-lead  filter  in  Figure  2.4, 
it  can  be  seen  that  a  DC  LFJnput  will  result  in  a  CNTR  voltage  equal  to  LFJnput.  CNTR 
is  the  controlling  voltage  of  the  VCO,  and  therefore  the  VCO  frequency  may  be  tuned  by 
adjusting  VCC.  This  was  done  on  ORCA2  of  chip  #4  and  the  resulting  data  appears  in  the 
graph  below. 

Note  that  as  expected,  the  VCO  responds  linearly  to  changes  in  CNTR.  The  VCO  ceased 
to  operate  at  voltages  below  about  2  volts.  Also  note  that  the  VCO  can  indeed  achieve 
operating  frequencies  of  100  MHz  and  above,  but  this  requires  high  VCC  values  at  which 
the  chips  begins  to  overheat. 


6 


C.  Testing  With  The  Optical  Clock  Signal 


The  optical  clock  signal  was  generated  with  a  4mW  laser  diode  at  750  nm  wavelength.  The 
laser  was  internally  biased  in  its  linear  region  and  was  directly  modulated  with  a  signal 

generator.  The  laser  beam  was  delivered  to  the  detector  as  shown  in  figure  3. 

CONTR  Voltage  vs.  VCO  Frequency 


CONTR  Voltage  (=VCC)  (Volts) 

Figure  2. 

Efforts  to  make  the  chip  respond  to  the  optical  clock  signal  were  fruitless.  The 
photodetectors  were  exposed  to  laser  light  modulated  at  a  few  MHz  below  the  natural 
operating  frequency  of  the  ORCA  oscillators  (50-65  Mfiz)  and  llie  frequency  was  then 
swept  up  in  increments  of  lOOKHz.  The  /out  pins  were  observed  for  signs  of  capture  and 
lock  of  the  Phase  Locked  Loop  by  direct  observation  of  the  oscilloscope  trace  and  by 
externally  triggering  the  scope  with  the  modulation  signal  and  watching  for  triggering  of  the 
/out  signal.  However  no  change  in  output  frequency  or  triggering  were  detected.  The 
ORCAs'  oscillators  did  not  lock  onto  the  modulating  frequency  of  the  laser.  Different 
ORCAs  on  the  various  chips  were  tried  with  various  laser  power  settings  ;  however,  no 
PLL  capture  was  detected.  The  only  response  to  the  laser  observed  on  the  /out  was  a 
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small  frequency  shift  corresponding  to  about  a  IMHz  drop  in  VCO  operating  frequency. 
This  phenomenon  is  probably  due  to  changes  in  device  parameters  caused  by  the  heat  of  the 
incident  laser.  Since  no  ORCA  locked  onto  the  optical  clock  signal,  the  rest  of  the  work 
consisted  of  testing  the  optical  system  for  a  possible  source  of  this  failure. 

Video 
Camera 


Figure  3 


There  are  several  possible  causes  for  the  failure  of  the  ORCA's  that  deal  with  the  external 
optical  test  equipment:  The  laser  not  modulating  or  not  modulating  at  a  frequency  in  the 
capture  range  of  the  phase-locked  loop;  the  small,  focused  laser  spot  not  hitting  the 
detector;  not  enough  or  too  much  laser  power  on  the  detector;  and  the  sinusoidal 
modulation  of  the  laser  is  not  sufficient  to  switch  the  digital  circuitry  (square-wave 


modulatior.  was  assumed  in  the  spice  modeling  of  Mr.  Welch).  As  many  as  possible  of 
these  causes  for  failure  were  isolated,  tested,  and  eliminated  as  reasons  for  the  chips’ 
failure.  The  rest  of  this  section  describes  this  procedure. 

The  fu^t  concern  was  that  the  laser  was  not  actually  being  modulated  due  to  faulty  circuitry 
in  the  laser  or  an  impedance  mismatch  between  the  signal  generator  and  the  laser.  It  was 
found  by  utilizing  a  fast  photodetector  that  impedance  matching  was  required  and  that  with 
this  matching  the  laser  was  indeed  modulated.  To  make  sure  that  the  modulation 
frequency  was  within  the  lock  in  range  of  the  ORCA,  the  frequency  was  set  as  close  to  the 
natural  frequency  of  the  ORCA  as  possible.  The  external  triggering  technique  was 
employed  to  detect  the  locking-on  to  the  signal,  but  none  was  observed. 

Alignment  of  the  laser  spot  was  a  difficult  procedure  and  very  prone  to  error.  This  was  due 
to  the  small  size  of  the  detector  (20  microns  square)  and  focused  laser  spot,  and  also 
because  of  the  difficulty  in  viewing  the  750  nm  wavelength  light  focused  onto  a  nearly 
specularly  reflecting  surface.  To  aid  in  alignment  a  microscope  with  a  video  camera  and 
monitor  was  implemented.  This  was  done  by  splitting  off  the  light  reflected  from  the 
chip's  surface  and  focusing  it  with  magnification  onto  a  video  camera.  In  order  to  view 
both  the  laser  spot  and  the  details  of  the  chip’s  surface  at  the  same  tiine,  it  was  necessary  to 
both  attenuate  the  laser  light  and  to  illuminate  the  chip's  surface.  In  order  to  achieve  the 
illumination  without  blocking  the  laser  it  was  necessary  to  disassemble  the  delayed  beam 
path  (described  by  Mr.  Welch  in  chapter  3  of  appendix  B)  and  insert  a  bright  focusing  LED 
next  to  the  beamsplitter  (see  figure  3).  This  path  removal  was  done  to  avoid  laser  power 
loss  due  to  an  additional  beam  splitter  and  because  the  delayed  path  was  not  needed  until 
we  could  get  the  ORCAs  to  lock  on  the  signal. 
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When  viewed  on  the  monitor  the  detectors  could  be  seen  and  the  laser  spot  could  be 
approximately  centered  on  the  detector.  It  was  difficult  to  estimate  the  spot  size  of  the 
laser,  however,  due  to  blooming  of  the  video  camera.  Even  with  careful  alignment  utilizing 
the  monitor,  the  ORCA's  still  would  not  lock  onto  the  laser  signal. 

Next  the  possibility  of  incorrect  laser  power  was  checked.  Since  no  power  meters  were 
available,  all  that  was  done  was  to  vary  the  average  power  of  the  laser  and  to  watch  for  the 
ORCA  to  lock  on.  The  ORCA  was  never  observed  to  lock  on  at  any  laser  power  used. 

The  last  possible  cause  of  failure  tested  was  the  modulation  signal.  Initially  a  sinusoidal 
signal  generator  was  used  to  modulate  the  laser  because  of  availability.  It  became  clear 
from  reading  Mr.  Clymer's  thesis  (appendix  A)  that  a  square  wave  modulated  clock  signal 
was  necessary.  A  pulse  generator  with  sufficient  bandwidth  was  obtained.  However,  the 
laser  itself  had  a  modulation  bandwidth  only  extending  to  5(X)  MHz  so  that  the  laser  limited 
the  rise  and  fall  times  of  the  clock  signal.  The  signal  generator  provided  only  enough 
power  to  modulate  the  laser  about  60%  which  may  or  may  not  be  sufficient  (simulation  on 
SPICE  would  be  necessary).  This  pulse  generator  was  used  with  visual  alignment  of  the 
beam  and  various  frequencies  and  powers  were  tried.  Lock-on  of  the  ORCA  was  never 
observed. 

Since  there  seemed  to  be  no  point  in  continuing  experimentation  with  laser  modulation, 
testing  with  an  unmodulated  laser  was  attempted. 

Sensitivity  of  the  photodetector  was  tested  by  using  different  powers  of  laser  light  to  the 
detector  and  measuring  the  chip's  response  via  fluctuations  in  its  supply  current.  Four 
different  levels  of  incident  light  were  used:  1)  No  incident  light;  2)  Room  light;  3)  Room 
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light  +  ImW  laser  light;  and  4)  Room  light  +  2mW  laser  light.  When  the  amount  of 
incident  light  was  changed  a  small  but  definite  shift  in  the  supply  current  was  noticed. 

However,  moving  the  laser  beam  around  in  the  vicinity  of  the  photodetector  showed  that 
the  maximum  change  in  power  supply  current  occurred  when  the  laser  beam  appeared  to  be 
below  and  to  the  left  of  the  photodetector.  Therefore,  these  fluctuations  in  supply  current 
may  be  due  to  the  heating  effects  of  the  laser  rather  than  changes  in  the  state  of  the  circuitry 
caused  by  current  generated  in  the  photodetector. 

D.  Theoretical  Calculations  And  Simulations 

Based  on  the  expected  component  values  of  the  low-pass/lag-lead  filter  a  capture  range  of 
about  4MHz  can  be  predicted.  The  procedure  used  for  this  calculation  is  taken  from  the 
Signetics  1985  "Linear  Data  Manual-Communications"  pp.  4-236  to  4-256.  Since  Mr. 
Welch  expected  filter  performance  to  be  essentially  constant  under  process  variations,  this 
number  is  expected  to  hold,  at  least  theoretically.  However,  Mr.  Welch's  low-pass/lag- 
lead  filter  is  a  second  order  filter  while  the  Signetics  formulae  assume  a  simple  first-order 
filter.  To  a  first  approximation,  however,  the  device  values  used  in  Welch's  design  are 
theoretically  reasonable. 

As  mentioned  before,  the  crucial  problem  with  the  SPICE  simulations  may  be  that  they  are 
not  accurate  enough  at  the  100  MHz  frequencies  used  in  this  design.  Validity  of  the  diode 
models  at  100  MHz  frequencies  is  especially  under  question.  Also,  note  that 
photo'.  ..>.:tors,  are  most  often  fabricated  as  p-i-n  structures  whereas  Welch's  MOSIS 
proa  c  d  not  allow  the  inclusion  of  an  intrinsic  layer  thereby  reducing  the  transport 
efficiency'  aie  junction. 


E.  Conclusions  And  Suggestions  For  Future  Work  On  Testing 

Testing  of  Mr.  Welch's  chip  did  allow  verification  or  near-verification  of  the  operation  of 
several  of  the  designs  components: 

•  By  the  mechanism  described  in  the  previous  pages,  it  was  possible  to  verify  the  correct 
operation  of  the  Voltage  Controlled  Oscillator. 

•  It  is  safe  to  assume  the  correct  operation  of  the  frequency  divider  circuitry.  Note  that 
Figure  1  illustrates  that  none  of  the  /out2  or  /out3  pios  show  an  output  frequency.  This 
cannot  be  due  to  flaws  in  the  divide-by-64  circuitry  design  because  the  /outo  ^nd  /outi 

use  similar  circuitry  to  divide  the  frequency  of  their  outputs. 

•  The  functionality  of  the  low-pass/lag-lead  filter  for  DC  LF_Input  values  is  certain. 

•  The  design  and  operation  of  the  phase  generation  circuitry  has  been  verified  despite  the 
lack  of  output  on  the  /oui2  or  f  out3-  This  can  be  said  by  considering  that  all  the  /out 

frequencies  are  generated  by  the  symmetric  Phase  Extraction  Circuitry  (Figure  2.15); 
therefore,  the  failure  of  two  frequencies  to  appear  on  the  pins  is  probably  due  to 
problems  other  than  partially  dysfunctional  Phase  Extraction  Circuitry. 

•  As  mentioned  before  the  output  of  the  Skew  Detection  and  Error  Detection  pins  was 
verified  as  correct  under  testing  without  optical  input. 


Despite  these  successes,  perhaps  the  most  instructive  lesson  of  Mr.  Welch's  work  has  been 
to  underscore  the  extra  difficulties  involved  in  testing  a  chip  with  an  optical  interface  to  the 
outside  world. 

A  major  cause  of  concern  is  that  the  current  design  affords  no  simple  and  effective  method 
for  the  experimenter  to  determine  the  exact  amount  of  light  incident  on  the  photodetector  of 
each  ORCA.  The  imaging  techniques  described  in  the  Optical  Testing  section  were  used  to 
direct  the  laser  beam  to  the  photodetector,  however,  the  blooming  due  to  the  local  saturation 
of  the  video  camera's  detector  made  it  impossible  to  determine  if  the  beam  had  indeed  been 
entirely  focused  on  the  20|im  by  20|im  photodetector.  Hence,  it  was  not  possible  to 
ascertain  the  amount  of  power  incident  on  the  exposed  surface  area  of  the  photodetector. 
No  external  pin  connections  were  included  in  the  design  that  could  have  allowed  this 
measurement.  For  future  designs,  it  would  be  extremely  prudent  to  provide  pin 
connections  or  some  other  mechanism  for  determining  the  exact  location  of  the  beam  on  the 
chip.  For  instance,  the  main  photodetector  could  be  surrounded  by  other  detectors  with 
connections  to  outside  the  chip.  By  measuring  the  percentage  of  total  light  incident  on  each 
detector,  it  would  then  be  possible  to  accurately  calculate  the  position  of  the  beam. 

As  far  as  Welch's  chip  is  concerned,  it  would  seem  that  the  design  philosophy  of  the  chip 
is  too  optimistic.  Before  an  entire  operational  chip  can  be  built,  the  design  of  the  individual 
sub-circuits  must  be  understood  and  shown  to  be  correct  and  reasonable.  This  is 
something  that,  depending  on  the  sub-circuit,  could  not  be  done  or  was  done  with  extreme 
difficulty  on  Welch's  chip.  Therefore,  future  designs  must  include  far  greater  testing 
capabilities  designed  onto  the  chip.  Obviously,  these  test  structures  must  be  as  independent 
as  possible  from  the  other  circuitry  on  the  chip.  Separate  power  supply  connections,  as 
well  as  a  separate  clock  is  recommended.  The  current  design  uses  the  four  phases 
generated  from  a  VCO  to  clock  shift  registers;  failure  to  produce  all  four  phases  means  that 
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much  of  the  testing  circuitry,  such  as  the  shift  registers,  will  not  operate.  Many  other 
testing  schemes  are  possible,  but  in  general  a  less  aggressive,  more  modularized  and 
testable  approach  is  required. 

The  lack  of  proper  equipment  also  proved  to  be  a  hindrance  in  the  efficient  testing  of 
Welch's  chip.  In  future  research  done  at  this  or  other  universities,  it  might  prove 
productive  to  arrange  for  testing  to  be  done  at  the  usually  better  equipped  industrial 
laboratories. 
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Abstract 


Timing  constraints  for  state-of-the-art  very  large  scale  integrated  circuits  (VLSI) 
in  silicon  are  rapidly  approaching  communication  limits  available  with  layered  two- 
dimensional  metal  and  polysilicon  wiring  approaches.  For  such  communication- 
limited  systems,  reliable  clock  distribution  is  a  key  concern.  The  range  of  finite 
differences  in  signal  delays  over  clock  wires  of  various  lengths  for  large  chips  creates 
a  timing  skew  that  is  significant  when  compared  to  the  switching  time  of  tran¬ 
sistors  in  the  circuit.  The  high  bandwidth  and  three-dimensionality  of  imaging 
optical  systems  suggest  that  optical  clock  distribution  systems  have  the  potential 
to  overcome  the  timing  barriers  presented  by  planar  wiring.  Clock  signals  can  be 
holographically  mapped  to  detector  sites  within  smaU  functional  cells  on  a  chip 
surface.  Within  each  functional  cell,  the  clock  is  distributed  via  smface  wires 
with  negligible  delays.  Since  the  difference  in  propagation  time  between  optical 
paths  is  negligible  when  compared  to  typical  electronic  response  times,  the  timing 
uncertainty  for  such  a  system  is  composed  of  two  parts:  the  difference  in  response 
times  (skew)  between  identical  receiver  copies  implemented  on  a  given  chip,  and 
the  rms  signal  jitter  at  each  receiver  due  to  circuit  noise.  An  overviev/  of  an  opti¬ 
cal  distribution  system  is  presented,  along  with  system  constraints  and  the  design 
trade-offs  these  constraints  represent.  Two  types  of  optical  receivers  for  CMOS 
implementation  are  presented:  a  transimpedance  receiver  similar  to  those  used 
in  optical  communication  systems,  and  a  phase-locked  loop  receiver  which  offers 
substantial  improvement  in  performance  and  layout  requirements  over  the  first 
approach.  Timing  analysis  for  the  two  types  of  CMOS  optical  receivers  is  pre¬ 
sented,  showing  that  a  2-micron  implementation  of  the  transimpedance  receiver 
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is  limited  to  a  maximum  operating  frequency  of  50  MHz  by  fabrication-related 
response  skews  of  up  to  3  ns,  while  a  2-micron  phcise-locked  loop  receiver  can 
operate  in  the  100  -  200  MHz  range  with  steady-state  response  skews  less  than 
55  ps  and  negligible  signal  jitter.  A  CMOS  test  chip  to  measure  the  performance 
of  a  3-micron  implementation  of  the  transimpedance  receiver  has  been  fabricated. 
Laboratory  measurements  show  that  the  maximum  operating  frequency  of  this 
receiver  is  approximately  15  MHz  when  150  /xW  of  incident  optical  power  is  ap¬ 
plied,  while  response  skew  ranges  from  5  -  20  ns.  Observation  of  storage  times 
for  dynamic  latch  cells  in  proximity  to  photodiodes  on  the  cliip  show  that  optical 
power  and  device  separation  specifications  can  be  reasonably  applied  to  guarantee 
minimum  storage  time. 
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Chapter  1 


Introduction 


The  large  bandwidth,  immunity  to  interference,  and  electrical  isolation  provided 
by  optical  communication  techniques  have  led  to  recent  interest  in  optical  inter¬ 
connections  for  computer  systems.  Optics  has  become  an  appealing  alternative  to 
wired  interconnections  on  several  levels  of  communication  hierarchy  within  com¬ 
puting  systems[l,2,3,4,5].  As  timing  performance  for  on-chip  processing  becomes 
communication  limited,  chip-wide  clock  signal  distribution  is  rapidly  becoming  a 
problem  in  which  an  optical  interconnection  solution  may  be  appropriate[l,4]. 

The  miniaturization  of  integrated  circuit  elements  by  layout  design  scaling  in 
very  large  scale  integrated  circuits  (VLSI)  has  created  a  great  deal  of  interest  in 
the  development  of  new  methods  for  reducing  the  timing  skew  associated  with 
transmitting  signals  via  wires  to  remote  locations  on  a  chip[6,7].  As  device  sizes 
decrease  and  chip  sizes  increase  with  technological  advances,  the  speed  of  instruc¬ 
tion  execution  on  a  VLSI  chip  becomes  limited  by  signal  transmission  delay  rather 
than  device  switching  delays[8,9,10].  Communication  delay  is  especially  critical 
in  the  distribution  of  the  clock  signal  which  is  used  to  synchronize  the  operation 
of  various  devices  on  a  chip.  Parasitic  transmission  line  capacitance  and  resis¬ 
tance  over  varying  pathlengths  for  this  widely  distributed  signal  cause  a  skew  in 
waveform  arrival  times  at  different  locations  on  the  chip. 
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CHAPTER  1.  INTRODUCTION 


For  optical  clock  distribution,  the  clock  signal  is  mapped  at  the  speed  of  light 
from  an  ofF-chip  laser  diode  via  an  optical  element  to  photodetectors  integrated 
on  the  surface  of  the  silicon  chip.  In  general,  the  optical  approach  could  be  three 
dimensional,  using  a  hologram  or  lenslet  array,  or  it  could  use  integrated  or  fiber 
optics.  The  system  presented  here  is  three  dimensional  in  that  the  space  above 
the  integrated  circuit  chip  is  used  to  route  the  optical  signals  rather  than  confining 
the  light  to  fibers  or  waveguides  in  a  planar  or  quasiplanar  topology.  In  an  optical 
system,  clock  skew  can  be  reduced  to  essentially  the  variation  in  response  times 
for  the  different  receivers  distributed  over  the  chip  surface. 


1.1  Clock  Distribution  with  Conventional  Wire 
Paths 

There  is  a  design  tradeoff  inherently  associated  with  wired  distribution  of  signals 
on  integrated  circuit  chips.  This  tradeoff  on  one  hand  involves  the  choice  of 
conductor  for  the  wire  path,  and  on  the  other  the  availability  of  the  medium 
chosen. 

VLSI  designers  can  route  signals  on  three  types  of  wires,  each  having  a  dif¬ 
ferent  resistivity  and  all  having  approximately  the  same  capacitance  per  unit 
area.  Aluminum  wires  have  a  resistivity  that  is  two  orders  of  magnitude  less  than 
polysilicon  or  diffusion  paths,  making  aluminum  a  very  desirable  wiring  medium, 
especially  for  long  paths[ll].  Chipwide  distribution  of  the  clock  signal  is  gener¬ 
ally  required  in  VLSI  designs,  requiring  long  paths  and  suggesting  metal  as  an 
appropriate  medium. 

Aluminum  wires  are  also  required  for  power  and  ground  distribution  because 
the  resistivity  of  other  media  causes  excessive  voltage  level  degradation[12].  In 
addition,  local  connection  between  drain  diffusions  of  p-channel  and  n-channel 
transistors  in  CMOS  logic  designs  is  made  with  metal  wiring  rather  than  polysil¬ 
icon,  to  avoid  forming  junctions  between  the  highly  doped  polysilicon  wire  and 
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one  of  the  drain  diifusions  which  is  doped  with  the  opposite  type  impurity.  Since 
many  VLSI  fabrication  technologies  support  only  one  or  two  levels  of  metal  in¬ 
terconnection,  the  remaining  metal  available  for  global  signal  distribution  can  be 
severely  limited.  Distribution  of  timing  signals  in  alumimum  adds  to  the  overhead 
requirements  on  the  metal  layers,  and  further  reduces  the  space  available  for  data 
paths. 

Two  problems  with  signal  distribution  by  means  of  wires  of  finite  resistance 
and  capacitance  become  especially  acute  when  the  chipwide  timing  system  is 
talcen  as  an  example.  The  clock  signal  is  used  to  synchronize  the  operations  of 
a  very  large  number  of  devices  on  a  VLSI  chip.  The  number  of  devices  that 
a  clock  distribution  system  must  accomodate  and  the  wide  range  of  distances 
between  devices  create  special  manifestations  of  the  general  wired  communication 
problems.  The  finite  capacitance  and  resistivity  of  the  wires  represent  a  very 
large  loading  effect  due  to  the  extensive  fanout  of  the  clock  distribution  system. 
The  wires  to  individual  devices  are  ultimately  interconnected  in  some  parallel 
manner;  therefore,  the  capacitances  of  the  individual  wires  add  to  represent  the 
load  capacitance  at  the  clock  driver.  The  transition  time  for  the  output  voltage  of 
the  clock  driver  is  proportional  to  the  ratio  of  the  load  capacitance  to  the  driver 
capacitance.  In  this  manner,  a  large  capacitive  load  causes  a  broadening  of  signal 
pulses  by  increasing  the  transition  time  for  the  clock  driver  and  slows  the  overzdl 
system  operation. 

One  design  method  for  reducing  the  effect  of  capacitive  loading  is  the  caiscading 
of  buffer  stages  with  successively  larger  gate  areas.  This  minimizes  the  total 
throughput  delay  by  allowing  each  buffer  to  charge  an  optimum  capacitive  load. 
The  optimum  increase  in  gate  area  is  by  a  factor  of  e;  however,  the  scale  by  which 
the  inverter  size  is  increeised  is  usually  four  or  five  in  order  to  make  efficient  use 
of  chip  surface  area[13]. 

Another  clock  distribution  problem  is  associated  with  the  finite  resistance  and 
c.  .pacitance  of  the  signal  paths.  The  circuit  lines  can  be  modeled  as  distributed 
RC  paths,  and  as  such,  the  waveform  propagation  is  described  by  the  diffusion 
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Figure  1:  An  H-tree  clock  distribution  system. 

equation[14].  Each  length  of  wire  has  an  associated  delay  that  is  a  function  of 
the  resistance,  capacitance  and  dimensions  of  the  wire.  Furthermore,  as  device 
sizes  become  smaller  with  layout  design  scaling,  this  characteristic  communication 
delay  increases  quadratically  for  the  same  length  of  wire[9].  The  large  range  of 
characteristic  communication  delays  for  the  wires  connecting  the  clock  driver  to 
the  many  clocked  devices  on  a  chip  leads  to  a  variation  in  the  signal  arrival  time 
from  one  device  to  another.  This  is  commonly  referred  to  as  clock  skew,  and  it  is 
a  primary  consideration  in  determining  the  system  clock  rate. 

There  have  been  several  approaches  suggested  and  implemented  to  reduce  or 
eliminate  the  clock  skew  effect.  One  such  approach,  suggested  by  Anceau,  in¬ 
volves  distributing  a  lower  frequency  chipwide  clock  signal  to  several  function2il 
blocks,  and  internally  synthesizing  a  high  frequency  clock  to  synchronize  opera¬ 
tions  within  each  block[6].  A  second  approach  involves  forcing  all  wire  lengths  to 
be  equal.  One  method  for  realizing  this  approach  is  an  H-tree[7].  An  example 
is  shown  in  Figure  1.  Line  length  variation  effects  are  eliminated  by  the  novel 
geometry  of  the  layout.  Remaining  skew  effects  in  such  an  H-tree  structure  are 
due  to  variations  in  the  threshold  voltages  of  the  load  transistors  and  charac¬ 
teristic  capacitance  and  resistance  between  different  paths[15].  Because  the  wire 
lengths  are  equal  with  this  approach  and  chipwide  distribution  of  the  clock  signal 
is  required,  H-tree  distribution  has  the  disadvantage  of  uniformly  long  wires,  and 
therefore  transitions  are  very  slow  due  to  a  very  large  capacitive  load  for  the  clock 
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driver.  In  addition,  the  H-tree  distribution  requires  a  regular  chip  layout  pattern, 
in  opposition  to  the  very  popular  modular  design  approach  in  which  the  layouts 
for  individual  fimctional  cells  are  developed  separately  and  incorporated  in  block 
form  at  the  chip  design  level.  A  third  clock  distribution  approach  eliminates  a 
chipwide  synchronization  signal  by  designing  several  functional  blocks,  each  being 
self-timed.  This  approach  allows  fast  execution  of  operations  in  the  functional 
block,  but  at  the  expense  of  handshaking  delays  for  communication  and  added 
control  lines  between  functional  blocks(16].  All  of  the  approaches  have  the  unfor¬ 
tunate  characteristic  of  requiring  massive  use  of  metal  wiring  due  to  the  lengths 
of  communication  lines  necessary  for  signal  coverage  of  an  entire  chip. 


1.2  Optical  Interconnection  for  Computing 

Optics  can  provide  several  advantages  for  interconnection  in  computing  systems[4]. 
One  major  advantage  for  optical  distribution  approaches  is  the  freedom  from 
mutual  coupling  between  different  signal  paths.  In  Si-Si02  systems,  there  is  a 
trade-off  between  distributed  capacitance  of  an  interconnection  wire  and  mutual 
capacitive  coupling  of  signals  to  adjacent  wires.  The  capacitance  between  a  wire 
and  the  ground  plane  can  be  reduced  by  increrising  the  dielectric  thickness  of  the 
Si02,  but  in  taking  this  action,  the  isolation  between  pareillel  wires  is  reduced,  and 
increased  mutual  coupling  results.  Optical  distribution  systems  do  not  exhibit  a 
similar  mutual  coupling  effect. 

A  second  advantage  of  optics  for  computer  interconnections  is  the  ability  of 
light  beams  to  pass  through  each  other  without  interacting.  This  is  primarily 
due  to  the  noninteraction  of  photons  in  linear  media.  This  allows  a  more  efficient 
use  of  layout  space;  it  is  valid  for  waveguide  systems  with  angles  of  intersection 
greater  than  10  degrees  and  virtually  all  free-space  optical  systems. 

A  third  major  advantage  applicable  to  free-space  optical  interconnections  is 
the  potential  to  use  rewritable  optical  materials  to  allow  real-time  reconfiguration 
of  the  interconnection  pattern. 
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Research  activities  are  currently  in  progress  to  incorporate  optical  approaches 
at  nearly  every  level  of  computer  interconnection  hierarchy.  Indeed,  optical  inter¬ 
connections  have  already  begim  to  appear  for  intercomputer  local  area  networks[17, 
18].  There  is  now  activity  to  investigate  optical  interconnection  between  proces¬ 
sors  in  a  multiprocessor  system[2].  Several  programs  have  been  applying  optical 
solutions  to  interchip  communication  probiems[19,20,21].  In  addition,  special  op>- 
ticEil  distribution  for  signals  with  critical  timing  and  large  fan-out  might  be  ad¬ 
vantageous  even  though  studies  have  shown  that  intrachip  optical  communication 
is  not  power  efficient[3,22].  Clock  distribution  to  several  ftuictional  areas  on  a 
given  VLSI  chip  represents  this  t3rpe  of  special  signal  distribution. 

The  implementations  for  optical  interconnections  that  have  been  demonstrated 
or  recommended  are  widely  varying.  Fiber  optic  systems  have  been  demonstrated 
for  local  area  networks  between  computers[17,18],  and  for  lower  levels  of  optical 
intercoimection  as  well[19,21,23].  Other  systems  are  based  on  planar  waveguide 
optics  [24]  or  three  dimensional  optics  with  holographic,  grating  or  lenslet  array 
elements  [1,3,4,20,21,25].  While  most  applications  reported  in  the  literature  have 
dealt  with  high  speed  data  signals,  several  authors  have  suggested  the  use  of  optics 
for  clock  distribution.  Hartman  has  proposed  a  waveguide  distribution  system  at 
the  board  level,  while  Goodman,  et  tt/.[l,4],  Fried[21],  and  Bergmein,  et  a/.[20]  have 
suggested  free-space  optical  systems  for  clock  distribution  at  the  chip  level.  Fried 
presents  a  phase-locked  loop  opticed  clock  extraction  circuit  for  CMOS  based  on 
a  voltage  controlled  ring  oscillator. 

The  optical  clock  distribution  system  that  is  presented  here  is  a  three  dimen¬ 
sional  design  in  which  a  clock  signal  is  mapped  from  an  off  chip  light  source  to 
several  photodetector  locations  on  an  integrated  chip  surface.  The  beam  mapping 
is  performed  by  means  of  a  holographic  optical  element  or  a  lenslet  array  with 
negligible  skew  for  the  optical  signal.  The  optical  signal  can  be  used  in  one  of  two 
manners.  In  one  approach,  the  optical  signal  can  simply  be  detected  by  means 
of  a  photodiode  and  amplified  to  a  digital  signal  level,  converted  to  a  standard 
VLSI  two-phase  clock  and  distributed  via  short  polysilicon  wires  to  the  individual 


1.3.  ORGANIZATION 


I 


devices  within  each  functional  cell.  In  an  alternative  approach,  the  optical  signal 
can  be  used  to  synchronize  several  local  oscillators  distributed  on  the  VLSI  chip. 
In  this  approach,  the  output  of  each  local  oscillator  provides  the  input  square  wave 
for  a  two-phase  clock  synthesizer  in  each  functional  cell,  and  the  two  phases  of  the 
clock  are  again  distributed  via  short  polysilicon  wires  to  devices  in  the  functional 
cell.  For  either  approach,  clock  signal  skew  is  characterized  by  the  variation  in 
photoreceiver  response  times  for  identical  receivers  implemented  on  a  given  chip. 


1.3  Organization 

The  next  chapter  is  a  detailed  description  of  the  proposed  optical  clock  distribution 
system.  An  overview  of  the  clock  distribu'  ’  'n  system  is  presented  first,  followed 
by  a  description  of  the  goals  for  the  system  design  and  implementation  and  a 
characterization  of  design  trade-offs.  The  presentation  includes  two  examples 
for  division  of  the  chip  surface  area  into  functional  blocks  along  with  suggested 
layout  topologies  within  the  cell  for  the  optical  receivers  and  the  local  polysilicon 
distribution  system.  The  optical  receiver  circuit  for  each  functional  cell  is  expected 
to  be  one  of  those  presented  in  Chapter  3  o*jChapter  4. 

A  description  and  analysis  of  a  standard  transimpedance  optical  receiver  is 
given  in  Chapter  3.  This  receiver  is  composed  of  analog  amplifer  stages  and  a 
digital  phase  divider  circuit.  Since  a  key  source  of  clock  skew  for  a  multiple 
receiver  chip  is  the  difference  in  response  times  of  identical  receivers  on  a  given 
chip,  an  extensive  analysis  is  included  which  characterizes  this  variation  as  a 
function  of  fabrication-related  parameter  variation  in  the  transistors  on  the  chip. 
A  second  source  of  timing  uncertainty  is  phase  jitter.  A  noise  analysis  of  the 
transimpedance  amplifier  is  presented  along  with  an  example  using  typical  device 
parameters  for  a  3-micron  CMOS  design. 

Chapter  4  is  a  description  and  analysis  of  a  phase-locked  loop  optical  receiver. 
The  elements  of  the  phase-locked  loop  are  described,  including  a  voltage  con¬ 
trolled  ring  oscillator  and  a  novel  biasing  method  for  the  photodiode  to  allow 
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phase  detection  without  requiring  a  separate  multiplier.  A  phase  jitter  analysis  is 
also  provided  for  the  phase-locked  loop  receiver,  followed  by  a  comparison  of  the 
incident  optical  power  levels  required  for  the  transimpedance  and  phase-locked 
loop  receivers  to  achieve  system  timing  rmcertainty  specifications. 

The  design  of  a  CMOS  chip  to  test  receiver  and  photodiode  performance  pa¬ 
rameters  is  presented  in  Chapter  5.  Two  tests  involve  measurement  of  the  max¬ 
imum  operating  frequency  and  response  skew  of  a  transimpedance  amplifier  re¬ 
ceiver.  A  third  test  involves  the  measurement  of  leakage  currents  between  the 
substrate  imder  a  photodiode  and  nearby  dynamic  memory  devices.  The  specific 
circuit  and  measurement  procedure  are  presented  for  each. 

Finally,  Chapter  6  contains  a  summary  of  contributions  represented  by  the 
work  that  has  been  presented.  In  this  chapter,  suggestions  are  made  for  future 
investigation  in  the  area  of  optical  clock  distribution  systems. 


Chapter  2 

Optical  Clock  Distribution 
System  Design 


In  this  chapter,  the  proposed  optical  clock  system  is  presented.  A  system  overview 
is  given  first,  followed  by  a  description  of  the  goals  for  the  system  design  and 
implementation,  and  a  characterization  of  design  trade-offs.  The  presentation 
includes  two  examples  for  organization  of  the  chip  surface  area  into  fimctional 
cells  along  with  suggested  layout  topologies  within  the  cell  for  the  optical  receivers 
and  the  local  polysilicon  distribution  system.  The  optical  receiver  circuit  for  each 
functional  cell  is  expected  to  be  one  of  those  presented  in  Chapter  3  or  Chapter 
4. 

2.1  System  Overview 

An  overview  of  the  proposed  optical  clock  distribution  system  is  shown  in  Figure  2. 
The  optical  clock  signal  is  generated  by  an  off-chip  light  source,  as  shown  at  the  top 
of  the  figure.  The  light  source  might  be  a  directly  modulated  laser  diode  or  LED, 
or  an  externally  modulated  continuous  wave  laser.  The  optical  beam  is  mapped 
through  a  focusing  element  to  various  photodetector  sites  on  the  chip  surface. 
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Figuie  2.  Optical  clock  distribution  system  overview. 


2.1.  SYSTEM  OVERVIEW 


11 


The  focusing  element  is  expected  to  be  a  hologram  or  lenslet  array,  depending  on 
the  wavelength  and  coherence  of  the  optical  source  chosen.  The  photodiodes  in 
Figure  2  axe  shown  in  a  regular  pattern  for  simplicity.  In  practice,  the  location  of 
the  photodetectors  is  random,  as  determined  by  the  modular  design  of  the  chip 
layout. 

One  of  the  advantages  of  this  optical  clock  distribution  system  is  that  it  allows 
for  modtilar  hierarchical  layout  design  for  the  electronic  circuits  on  the  chip.  In  the 
system,  the  layout  design  of  the  chip  is  such  that  surface  area  is  divided  into  small 
regions  which  we  lefer  to  as  fimctional  cells.  The  dimensions  of  the  functional 
cell  are  determined  by  the  maximum  length  of  polysilicon  wire  for  which  signal 
propagation  delay  is  acceptable  for  the  given  system  operation  requirements.  Each 
such  functional  cell  contains  a  photoreceiver  for  the  detection  of  the  optically 
distributed  clock  signal.  Each  photoreceiver  includes  the  necessary  electronics  to 
create  a  digital  two-phase  clock  from  the  single-phase  optical  input.  The  two-phase 
electronic  clock  is  distributed  via  polysilicon  wires  to  clocked  devices  within  each 
functional  cell.  The  use  of  polysilicon  as  a  local  distribution  medium  minimizes 
the  requirements  for  aluminum  in  clock  signal  routing. 

The  optical  clock  distribution  system  that  is  presented  represents  a  two  level 
hierarchical  distribution  in  which  the  top  level  of  distribution  is  realized  optically, 
and  the  bottom  level  is  implemented  via  polysilicon  wires.  The  characteristic 
communication  delay  for  the  optical  path  between  the  light  source  and  the  pho¬ 
todetectors  on  the  surface  of  the  chip  is  negligible  when  compared  to  the  switching 
time  of  a  typical  electronic  device  on  the  chip.  Since  the  length  of  the  longest 
polysilicon  wire  is  required  to  represent  a  negligible  transmission  line  delay,  the 
clock  skew  for  the  system  is  represented  by  the  differences  in  response  times  for 
the  various  photodiodes  and  amplifiers  that  are  distributed  over  the  surface  of  the 
chip.  If  receiver  designs  can  be  found  in  which  this  type  of  skew  is  appreciably  less 
than  that  for  a  wired  distribution  system  for  the  same  size  chip,  then  the  optical 
system  represents  an  improvement  over  existing  clock  routing  methods. 


12 


CHAPTER  2.  OPTICAL  CLOCK  DISTRIBUTION  SYSTEM  DESIGN 


Functional  ceil 


Clocked  device  - 
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Figure  3:  Chip  layout  for  centered  local  clock  distribution. 


The  layout  organization  of  the  functional  cell  can  be  adapted  for  the  applica¬ 
tion  expected.  One  organization  involves  placing  the  photoreceiver  in  the  center 
of  the  functional  cell,  and  distributing  the  clock  signals  in  all  directions  to  the  sur¬ 
rounding  devices.  This  approach  is  depicted  in  Figure  3.  The  inset  shown  in  the 
upper  right  corner  illustrates  the  organization  of  the  functional  cell  in  which  the 
photoreceiver  and  clock  driver  are  centered  in  the  cell.  The  smaller  squares  sur¬ 
rounding  the  clock  driver  represent  typical  clocked  devices,  such  as  shift  register 
stages. 

A  second  organization  might  be  more  appropriate  for  dataflow  layouts  in  which 
the  clocked  devices  are  arranged  in  lines  for  bit  slice  operations.  For  this  type  of 
application,  the  photoreceiver  and  clock  driver  can  be  designed  to  fit  in  a  long 
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Functional  cell 


Figure  4:  Chip  layout  for  dataflow  organization. 

rectangular  box,  with  the  clock  signals  being  distributed  perpendicular  to  the 
length  of  the  receiver.  This  design  approach  is  shown  in  Figure  4.  Again,  the 
division  of  the  chip  into  functional  cells  is  shown  in  the  lower  left  comer,  while 
the  organization  of  the  functional  cell  is  given  in  the  inset  at  the  upper  right.  The 
long  box  represents  the  photoreceiver  and  clock  driver  circuit,  while  the  small 
squares  represent  the  clocked  elements  in  the  dataflow  path. 

Two  receiver  designs  have  been  developed  which  are  compatible  with  optical 
detection  and  MOS  technology.  One  design  is  a  transimpedance  amplifier;  a 
detailed  description  of  the  design  and  operation  of  this  receiver  is  presented  in 
Chapter  3.  The  second  receiver  design  is  bcised  on  a  phjise-locked  loop  with  a 
voltage  controlled  ring  oscillator.  A  description  of  this  circuit  is  given  in  Chapter 
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2.2  Goals  for  system  design  and  implementation 

The  design  and  implementation  goals  for  the  optical  clock  distribution  system  are 
primeirily  concerned  with  using  existing  technology  whenever  possible.  In  many 
cases  there  are  conflicts  that  arise  due  to  the  difference  in  optical  and  electronic 
technologies  and  materials.  The  trade-offs  that  result  from  such  conflicts  are 
presented  in  the  next  section. 

One  of  the  principle  design  goals  is  to  apply  existing  MOS  technology  for  the 
photoreceiver  circuit  elements.  This  excludes  the  possibility  of  p-i-n  photodiodes, 
because  an  intrinsic  layer  cannot  be  formed  in  standard  MOS  fabrication  pro¬ 
cesses.  Customized  non-silicon  photodiodes  are  also  excluded,  because  packaging 
such  devices  would  require  hybrid  mounting  and  bonding  technology. 

In  standard  MOS  fabrication  processes,  the  depth  of  the  dreiin  and  source 
junctions  is  approximately  0.5  microns  or  less.  The  jimction  depth  can  affect  the 
efficiency  of  a  photodiode  as  well  as  the  response  time.  Design  trade-offs  caused  by 
fabrication  contrzdnts  on  photodiodes  are  discussed  in  further  detail  in  Section  2.3. 

A  second  goal  for  the  optical  receiver  design  is  to  use  a  photodiode  bias  within 
the  range  of  supply  voltages  for  the  existing  circuits  on  the  chip.  Since  standard 
VLSI  designs  have  supply  voltages  of  0  and  5  volts,  the  photodiode  bias  must 
lie  in  this  range.  This  limits  the  width  for  the  photodiode  depletion  layer,  and 
ultimately  the  efficiency  and  response  time  of  the  device.  Again,  the  consequences 
of  this  design  choice  are  discussed  in  detail  in  the  next  section. 

Another  principle  design  goal  is  to  use  existing  optical  receiver  designs  if  pos¬ 
sible.  Optical  commimications  technology  has  developed  good  integrated  receiver 
designs,  some  being  implemented  in  MOS  technology.  An  example  of  a  MOS  op¬ 
tical  transimpedance  receiver  is  given  in  [26].  The  receiver  described  in  Chapter  3 
is  based  on  this  design.  Other  optical  receivers  based  on  phase-locked  loops  have 
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been  reported  in  the  literature[21,27].  The  receiver  described  in  Chapter  4  has 
been  adapted  from  these  designs. 

A  fourth  goal  for  the  optical  clock  distribution  system  is  to  use  existing  optical 
sotirces.  A  variety  of  sources  are  available  from  optical  commvmication  technol¬ 
ogy.  The  sources  commonly  used  in  optical  fiber  systems  range  from  infrared  laser 
diodes  to  visible  LEDs.  Many  of  the  laser  diode  sources  emit  light  at  wavelength 
longer  than  the  bandgap  for  silicon  and  are  tmusable  for  clock  distribution;  how¬ 
ever,  several  inexpensive  laser  diodes  emitting  at  wavelengths  near  0.8  microns 
have  been  developed  for  optical  disk  storage  systems.  Another  alternative  could 
be  using  a  visible  wavelength  laser  and  an  acousto-optical  modulator  as  the  op¬ 
tical  clock  source.  This  approach  seems  applicable  to  Icirge  systems  in  which  a 
single  optical  clock  is  distributed  to  several  VLSI  chips. 


2.3  Design  Trade-Offs 

The  principle  system  design  trade-offs  involve  the  physics  of  photodetection,  the 
wavelength  of  the  optical  signal,  the  choice  of  optical  source,  and  the  choice  of 
focusing  element  for  the  mapping  of  the  optical  signal  to  the  photodetectors.  A 
description  of  the  relationship  between  these  eispects  of  the  optical  clock  distribu¬ 
tion  system  is  presented  in  this  section. 

In  a  reverse-bicised  p-n  junction  diode,  an  electric  field  is  maintained  across 
the  depletion  layer.  Carriers  generated  in  the  depletion  layer  drift  in  the  presence 
of  this  field  across  the  diode  junction  to  form  a  current,  commonly  called  the 
drift  current.  Carriers  generated  outside  the  depletion  layer  diffuse  from  higher 
concentrations  to  lower  concentrations.  Since  the  depletion  layer  edge  represents 
a  lower  concentration  of  minority  carriers,  minority  carriers  generated  outside 
the  depletion  layer  statistically  tend  to  diffuse  toward  the  depletion  layer.  If  the 
carriers  reach  the  edge  of  the  depletion  layer  before  recombining,  then  they  are 
swept  across  the  diode  junction  to  form  a  current,  commonly  called  the  diffusion 
current. 
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The  random  nature  of  the  diffusion  process  causes  it  to  be  a  much  slower  mech¬ 
anism  than  drift.  For  this  reason,  the  drift  current  has  a  much  faster  response  than 
tilt  diffusion  current.  For  faso  photodetection,  nearly  all  of  the  photons  should  be 
absorbed  in  the  depletion  layer,  with  the  resulting  photocurrent  being  primarily 
a  drift  current.  For  high  sensitivity  without  a  fast  response,  simply  requiring  that 
nearly  all  photons  are  absorbed  with  the  characteristic  recombination  length  of 
the  depletion  edge  is  sufficient  [28]. 

The  designer  of  a  photodetector  generally  has  the  option  of  applying  a  large 
enough  reverse  bias  to  adjust  the  depletion  layer  width  to  match  the  photon 
absorption  length  for  the  wavelength  of  light  that  is  detected.  This  is  where 
a  principle  conffict  between  design  elements  of  the  goals  for  the  optical  clock 
distribution  system  is  observed. 

The  photon  absorption  length  in  silicon  for  red  light  is  around  2-3  microns.  By 
applying  a  reverse  bias  of  5  volts,  the  depletion  layer  width  can  match  this.  If  the 
prefered  source  technology  is  a  laser  diode,  emitting  at  a  near  infrared  wavelength 
of  around  0.8  microns,  then  the  photon  absorption  length  in  silicon  is  in  the  range 
of  10-25  microns.  The  depletion  layer  thickness  can  no  longer  be  increased  to 
match  the  absorption  length  because  there  is  a  maximiun  of  5  volts  available  for 
biasing  the  diode.  The  result  is  slower  response  times  for  the  photodiode  and 
poorer  sensitivity.  In  addition,  many  of  the  photons  that  penetrate  deep  into  the 
silicon  substrate  can  diffuse  to  the  depletion  regions  of  nearby  transistors  rather 
than  diffusing  to  the  photodiode  junction[29].  This  not  only  represents  a  loss  in 
efficiency,  but  also  a  crosstalk  current.  If  the  nearby  transistor  is  an  element  in 
a  dynamic  storage  device,  the  result  could  be  a  memory  erasure  with  every  clock 
cycle. 

The  problems  of  leakage  currents,  poor  sensitivity  and  response  time  can  be 
reduced  by  any  of  three  methods.  One  choice  involves  using  a  laser  diode  source 
emitting  light  at  a  near  infrared  wavelength,  and  modifying  the  recombination 
characteristics  of  the  silicon  near  the  photodiode.  This  involves  a  non-standard 
fabrication  process,  and  one  of  the  design  goals  is  compromised. 
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A  second  choice  is  to  require  a  design  rule  specifying  a  minimum  separation 
length  between  a  photodiode  and  any  other  active  device.  This  approach  requires 
an  increase  in  layout  design  constraints  and  overhead,  and  does  not  solve  the  poor 
response  time  and  sensitivity  problems. 

The  remaining  choices  involve  using  visible  light  for  the  optical  signal.  The 
designer  can  use  a  visible  wavelength  LED  as  the  optical  source,  but  light  emitted 
by  an  LED  is  temporally  and  spatially  incoherent,  and  a  holographic  focusing 
element  requires  coherent  light  for  good  resolution.  A  lenslet  array  can  be  used 
with  the  LED  source,  eliminating  the  requirement  for  coherent  light,  but  the 
lenslet  arrays  that  are  commercially  available  are  fabricated  in  regular  patterns, 
and  the  random  mapping  ability  of  the  focusing  element  is  lost.  If  the  photodiode 
locations  are  constrained  to  a  regular  pattern,  the  flexibility  of  the  cellular  layout 
design  for  the  chip  is  compromised. 

A  final  choice  is  to  use  a  visible  wavelength  coherent  source  such  as  a  He-Ne 
or  argon  ion  laser.  These  lasers  can  be  operated  in  a  continuous  wave  mode,  and 
the  beam  can  be  modulated  externally  by  means  of  an  acousto-optic  modulator. 
In  this  approach,  the  packaging  of  the  system  becomes  less  attractive  than  with 
semiconductor  sources,  but  perhaps  the  higher  optical  output  power  of  the  larger 
laser  can  be  used  to  provide  an  optical  clock  signal  to  an  entire  system  rather 
than  a  single  chip  to  compensate  for  the  added  complexity. 

The  options  of  coherence,  wavelength  and  packaging  of  the  optical  source  for 
the  clock  distribution  system  presented  allow  the  designer  to  choose  the  source 
that  best  fits  the  specifications  of  the  VLSI  system  with  which  it  will  be  imple¬ 
mented.  The  trade-offs  that  have  been  presented  are  characteristic  of  any  optical 
clock  distribution  system  in  which  the  design  goals  of  Section  2.2  apply,  regardless 
of  the  optical  receiver  design.  The  specific  receiver  designs  that  are  anticipated 
for  use  in  an  optical  clock  distribution  system  are  presented  in  Chapter  3  and 
Chapter  4  and  accomodate  any  of  the  optical  source  or  focusing  element  choices 
that  have  been  presented  here. 
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Chapter  3 


Transimpedance  Receiver  and 
Clock  Driver 


This  chapter  is  a  presentation  of  the  design  and  operating  parameters  of  a  simple 
transimpedance  optical  receiver  and  clock  driver.  The  receiver  and  two  phase 
clock  drivers  circuits  are  both  CMOS  adaptations  of  NMOS  designs  that  have 
appeared  in  the  literature[26,30].  Descriptions  of  the  receiver  and  clock  driver 
circuits  are  given  in  Section  3.1. 

The  timing  uncertainty  in  a  distribution  system  composed  of  several  receiver 
circuits  consists  of  two  parts.  One  type  of  uncertainty  is  static,  i.  e.  it  is  fixed  for 
a  given  chip  and  varies  for  different  issues  of  the  chip.  This  is  the  skew  due  to 
differences  in  receiver  response  times  caused  by  fabrication-related  variations  in 
transistor  channel  dimensions  and  threshold  voltages  on  a  given  chip.  A  second 
uncertainty  is  temporal  and  is  characterized  by  phase  variations  due  to  noise  in 
the  receiver  circuit.  A  computer  simxilation  of  the  static  timing  skew  for  the  trans¬ 
impedance  receiver  and  clock  driver  is  presented  in  Section  3.2,  and  an  analysis 
of  the  clock  signal  phase  noise  is  given  in  Section  3.3. 


19 


20  CHAPTER  3.  TRANSIMPEDANCE  RECEIVER  AND  CLOCK  DRIVER 


3.1  Circuit  Descriptions 


A  circuit  diagram  for  a  transimpedance  optical  receiver  and  clock  driver  is  shown 
in  Figure  5.  The  devices  shown  to  the  left  of  A  in  the  figure  represent  a  trans¬ 
impedance  receiver  with  analog  inverting  buffer  stages  to  amplify  the  detected 
optical  signal  to  voltage  levels  suitable  for  input  to  digital  CMOS  logic  devices. 
The  design  and  operation  of  this  receiver  are  presented  in  Section  3.1.1.  This 
design  is  very  simple  and  is  intended  to  demonstrate  the  performance  nature  of 
this  type  of  receiver,  rather  than  reflect  an  optimized  circuit.  The  circuit  elements 
to  the  right  of  A'  represent  a  standard  VLSI  two-phase  clock  driver  circuit.  The 
details  of  the  driver  circuit  are  presented  in  Section  3.1.2  with  a  brief  introduction 
to  two-phase  clock  systems  for  VLSI. 
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Figure  6:  Analog  CMOS  inverting  voltage  amplifier  stage. 

3.1.1  Transimpedance  Receiver 

The  transimpedance  receiver  that  is  shown  in  Figure  5  has  been  adapted  for 
CMOS  from  a  design  published  by  Abidi[26].  The  receiver  circuit  is  composed  of 
analog  amplifier  stages.  Each  stage  contains  an  NMOS  pull-down  transistor  and  a 
pcissive  PMOS  pull-up  resistor,  as  shown  in  Figure  6.  The  first  stage  has  a  resistive 
feedback  path,  as  shown  in  Figure  5,  which  allows  the  photocurrent  to  modulate 
the  gate  voltage  of  the  NMOS  transistor  of  the  first  stage.  The  output  of  the  first 
stage  is  modulated  accordingly,  and  is  the  input  voltage  for  the  next  inverter.  The 
ratio  of  pull-up  to  pull-down  channel  length  dimensions  for  all  stages  in  the  chain 
is  constant  and  determined  by  the  biasing  requirements  of  the  receiver. 

In  addition  to  providing  a  transimpedance  path,  the  resistive  feedback  also 
serves  to  self-align  the  dark  state  operating  points  of  the  inverter  stages  in  the 
chain.  When  no  photocurrent  is  present,  each  inverter  stage  rests  at  a  steady  state 
operating  point.  Since  there  is  no  current  flowing  through  the  feedback  resistor, 
the  output  voltage  of  the  first  stage  is  equal  to  the  input  voltage,  and  the  inverter 
is  held  in  equilibrium.  Because  the  stages  are  designed  to  have  similar  dimensions 
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and  the  gain  per  stage  is  small,  this  equilibrium  point  is  identical  for  all  stages  in 
the  chain;  therefore,  when  the  input  stage  retvirns  to  equilibrium,  all  subsequent 
stages  follow. 

The  photodetection  and  amplification  can  be  described  in  terms  of  the  mod¬ 
ulation  of  the  amplifier  chain  voltage  between  the  dark  state  equilibrium  and  a 
lighted  state  level.  When  a  photocurrent  is  present,  the  output  voltage  of  the  first 
stage  is  driven  above  the  dark  stage  bias  point,  and  the  subsequent  stages  each 
amplify  and  invert  this  signal.  When  the  light  is  removed,  the  output  voltage 
of  each  stage  retvums  to  the  dark  state  equilibrium  point.  Since  the  output  of 
the  amplifier  chain  is  required  to  provide  reliable  TTL  input  voltage  levels  for 
the  two-phase  clock  driver  circuit,  the  dzirk  state  equilibrium  voltage  of  the  chain 
must  be  small  enough  to  be  reliably  detected  as  a  low  logic  level,  and  the  lighted 
state  output  voltage  of  the  last  stage  must  be  large  enough  to  represent  a  high 
TTL  input  level.  The  choice  of  TTL  voltage  levels  is  a  consequence  of  the  design 
constraint  to  apply  the  optical  clock  distribution  approach  to  standard  CMOS 
VLSI  systems. 

The  bias  voltage  also  determines  the  gain  for  each  inverter  stage  in  that  it 
determines  the  transconductance  for  each  transistor.  The  open  loop  gain  for  a 
CMOS  inverting  amplifier  stage  is  given  as 
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where  g^i  and  are  the  transconductances  of  the  NMOS  and  PMOS  transistors, 
respectively.  Ip  is  the  drain  current,  Vg5i  and  Vg52  are  the  gate-to-source  voltages, 
and  Vti  and  Vt2  are  the  threshold  voltages[31]. 

A  third  amplifier  design  parameter  that  is  affected  by  the  bias  of  the  transistors 
in  the  inverter  stages  is  the  maximum  rate  at  which  the  output  voltage  of  a  given 
stage  can  make  a  transition  from  one  voltage  level  to  another.  The  is  commonly 
called  the  slew  rate,  and  often  it  determines  the  maximum  operating  frequency 
of  a  receiver.  The  relationship  between  the  slew  rate  and  the  bizis  voltage  for  a 
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single  inverter  is  given  as 

SR  =  {Vcs  —  Vf)u;i  (2) 

where  Vcs  and  Vj  are  the  gate  and  threshold  voltages  of  the  pnll-down  transistor 
and  uji  is  the  unity  gain  frequency  of  the  amplifier[32]. 

A  comparison  of  Equation  1  and  Equation  2  shows  that  a  trade-off  exists  in 
determining  the  bias  voltage.  To  maximize  the  open  loop  gain  for  each  stage, 
the  designer  prefers  a  bias  in  which  the  gate  voltage  of  the  pulldown  transistor  is 
very  near  the  threshold  voltage.  An  amplifier  stage  with  an  optimum  slew  rate, 
however,  should  be  designed  so  that  the  gate  voltage  is  much  higher  than  the 
threshold.  For  the  circuit  simulations  presented  in  Section  3.2  and  the  circuits 
used  in  the  test  chip  described  in  Chapter  5,  a  bias  voltage  has  been  chosen  which 
permits  a  high  slew  rate  at  the  expense  of  very  low  gain  per  stage.  To  some 
extent,  the  low  gain  per  stage  can  be  compensated  by  an  increase  in  the  number 
of  stages  in  the  amplifier  chain.  The  gain  per  stage  for  these  circuits  is  around  2, 
while  the  receiver  slew  rate  is  sufficient  to  amplify  67  MHz  input  signals  to  TTL 
voltage  levels  when  transistor  channel  lengths  are  2  microns. 


3.1.2  Two-Phase  Clock  Driver 

The  output  of  the  transimpedence  receiver  that  is  presented  in  the  previous  section 
is  a  single  square  wave  signal.  While  a  single-phase  clock  can  be  used  to  synchro¬ 
nize  operations  for  computing,  VLSI  designers  generally  use  two  clock  signals  with 
mutually  exclusive  active  phases.  An  introduction  to  timing  and  synchronization 
with  two-phase  clock  signals  and  a  simple  circuit  that  can  be  used  to  synthesize 
the  two  phases  from  a  single  square  wave  are  presented  in  this  section.  A  more 
thorough  description  of  timing  for  VLSI  is  found  in  Ref.  [33]. 

In  synchronous  logic  systems,  the  clock  signal  serves  to  maintain  the  order 
of  occurance  of  events  as  well  zis  defining  the  time  frame  in  which  combinatorial 
logic  may  be  performed.  During  execution  of  a  sequence  of  events,  the  clock  signal 
controls  an  instruction  counter  which  determines  which  operations  are  performed 
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Figure  7:  A  finite  state  machine  with  a  single-phase  clock. 
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Figure  8:  Timing  for  a  single-phase  clock. 

on  each  clock  pulse.  In  this  manner,  the  execution  speed  of  the  procef  3or  is  limited 
by  the  clock  signal  frequency.  In  addition,  the  width  of  each  clock  signal  pulse 
defines  the  time  frame  in  which  a  given  set  of  instructions  must  be  executed. 
The  clock  period  must  be  long  enough  to  guarantee  that  all  of  the  required  logic 
operations  have  settled  to  a  steady  state  before  the  next  pulse  occurs. 

The  advantages  of  a  two-phase  clocking  scheme  are  most  easily  demonstrated 
by  first  considering  the  operation  of  a  finite  state  machine  with  a  single-phase 
clock.  Such  a  system  is  shown  in  Figure  7  and  the  corresponding  clock  signal  is 
shown  in  Figure  8.  When  a  positive  clock  signal  is  applied  to  the  pass  transistor, 
the  next-state  output  voltage  levels  are  transfered  to  the  present-state  inputs.  The 
combinatorial  logic  operations  begin  as  soon  as  the  input  voltages  are  presented, 
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Figure  9:  A  finite  state  machine  with  a  two-phase  clock. 


Maximum  logic  delay 
—  Clock  period 


Figure  10:  Timing  for  a  two-phase  clock. 

and  if  the  clock  pulse  is  too  wide,  the  output  voltages  due  to  the  combinatorial 
logic  execution  can  be  reapplied  to  the  inputs  via  the  pass  transistor.  Since  the 
next-state  outputs  become  active  inputs  to  the  present  state  combinatorial  logic 
devices  before  they  are  expected,  errors  in  processing  occur.  For  reliable  operation, 
the  combinatorial  logic  delay  must  be  longer  than  the  width  of  the  clock  pulse, 
yet  shorter  than  the  clock  period. 

The  two-sided  design  constraint  that  is  present  in  single-phcise  clock  systems 
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cein  be  reduced  to  a  one-sided  constraint  by  the  use  of  two  clock  signals.  This 
approach  is  commonly  called  two-phase  clocking,  and  an  example  of  a  finite  state 
system  with  a  two-phase  clock  is  shown  in  Figure  9.  A  timing  diagram  for  the 
two-phases  of  the  clock  is  shown  in  Figure  10. 

An  important  relationship  between  the  two  clock  phases,  (f>i  and  (j>2,  shown  in 
Figure  10,  is  that  when  one  is  positive  the  other  must  be  zero.  In  the  timing  of  this 
system,  there  are  four  time  intervals:  an  interval  in  which  only  <f>i  is  positive;  an 
interval  in  which  (f>i  returns  to  zero,  allowing  4>2  to  become  positive;  an  interval  in 
which  only  (f>2  is  positive;  and  an  interval  in  which  (j>2  returns  to  zero,  allowing  (^i 
to  become  positive.  Dtiring  the  positive  interval  of  4)\ ,  previously  stored  data  are 
transfered  from  intermediate  buffers  via  pass  transistors  to  present-state  inputs. 
4>i  must  remain  positive  long  enough  to  charge  the  input  capacitance  through 
the  pass  transistor.  After  charging  the  input  capacitance,  combinatorial  logic  is 
executed  and  output  voltage  levels  reach  a  steady  state.  The  width  of  the  pulse 
for  4)1  can  be  much  longer  than  the  execution  time  of  the  combinatorial  logic, 
because  the  feedback  path  is  blocked  by  the  zero  veilue  of  4>2-  The  transition  of 

to  zero  can  occur  at  any  time  after  the  inputs  have  been  charged,  because 
the  input  voltage  levels  are  maintained  once  the  pass  trzmsistor  is  turned  off. 
4>2  may  become  positive  at  anytime  after  4>i  has  become  zero,  however  it  must 
remain  positive  long  enough  for  the  combinatorial  logic  to  be  completed  and  the 
output  voltage  levels  for  the  next  state  to  be  transfered  to  the  intermediate  buffer 
by  means  of  the  second  pass  transitor.  Since  4>i  and  4>2  are  not  allowed  to  be 
positive  at  the  same  time,  there  is  no  continuous  feedback  path;  therefore,  the 
only  timing  constraint  is  that  the  clock  period  must  be  sufficiently  long  to  allow 
the  combinatorial  logic  to  be  completed,  along  with  a  nominal  delay  for  charging 
the  present-state  inputs  and  a  preset  time  to  insure  stable  next-state  outputs. 

It  should  be  noted  that  for  CMOS  circuits,  the  pass  devices  are  generally 
implemented  as  a  PMOS  and  NMOS  transistor  connected  in  parallel,  eis  shown  in 
Figure  11.  In  this  configuration,  the  control  signal  for  the  PMOS  transistor  is  the 
complement  of  the  control  for  the  NMOS  transistor.  Four  clock  signals  must  be 
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Figure  12:  A  two-phase  clock  synthesis  circuit, 
distributed  for  CMOS  VLSI  systems,  eg.  <j)i,  <f>2  and  the  complements  of  <f>i  and 

(f>2. 

Circuits  which  synthesize  two-phase  clock  signals  from  a  single  square  wave 
input  are  well  imderstood.  An  example  of  such  a  circuit  is  shown  in  Figure  12. 
This  clock  driver  is  modeled  after  those  discussed  in  Ref.  [33].  The  condition 
of  mutually  exclusive  positive  clock  phases  is  achieved  by  using  each  clock  phase 
as  an  input  to  a  nand  gate  to  generate  the  other  clock  phase.  In  this  manner, 
the  positive  phase  of  one  clock  signal  is  dependent  on  the  zero  level  of  the  other. 
The  buffers  following  the  nand  gates  in  Figure  12  provide  the  drive  capability 
to  distribute  the  signal  to  the  large  capacitive  load.  The  complementary  clock 
phases  are  present  at  the  inputs  to  the  last  buffer  stage  for  each  clock  signal. 
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3.2  Static  Receiver  Response  Skew  Simulations 


In  a  distribution  system  consisting  of  several  receiver-driver  circuits  on  a  given 
chip,  the  signal  skew  can  be  represented  as  the  difference  in  response  times  for 
the  copies  of  these  circuits  on  the  chip.  The  designer  can  minimize  the  response 
skew  effect  by  using  identical  layouts  for  the  receiver-drivers;  however,  the  actual 
transistor  parameters  can  vary  from  one  copy  to  another  on  a  given  chip  due  to  the 
nature  of  the  fabrication  process.  The  fabrication-induced  parameter  variations 
cause  differences  in  response  times  for  identically  drawn  circuits  and  can  be  a 
major  component  in  signal  skew,  especially  for  analog  MOS  circuits.  The  circuit 
simulation  study  that  is  presented  in  this  section  has  been  performed  to  determine 
the  typical  receiver-driver  response  skew  resulting  from  reasonable  variations  in 
transistor  channel  dimensions  and  threshold  voltages. 

A  SPICE  simulation  of  the  output  signal  for  the  simple  optical  clock  detector 
and  driver  presented  in  Section  3.1  is  shown  in  Figure  13.  Here,  the  two  clock 
phases  are  shown  for  an  example  with  the  nominal  transistor  threshold  voltage 
and  channel  dimension  values.  The  simulation  reflects  a  layout  design  in  which 
the  nominal  transistor  channel  dimension  is  2  microns.  The  nominal  width  of  each 
output  buffer  stage  is  320  microns.  The  width  of  the  input  pull-down  transistor 
for  the  transimpedance  amplifier  is  1380  microns  and  represents  the  largest  device 
dimension  in  the  circuit.  As  noted  in  Section  3.1,  this  circuit  is  intentionally  de¬ 
signed  to  be  simple  and  device  dimensions  have  not  been  optimized  with  respect 
to  trade-offs  between  output  response  and  layout  overhead.  The  experiment  is 
intended  to  give  an  indication  of  operation  of  the  general  approach  when  pho¬ 
todetection  and  clock  driver  circuitry  are  confined  to  a  reasonably  small  area  (less 
than  0.003  cm^). 

For  all  simulations,  the  photodetector  is  modeled  as  a  reverse  biased  p-n  junc¬ 
tion  diode  in  parallel  with  an  independent  current  source.  The  current  source 
waveform  is  modeled  as  a  12.5  /lA  peak  current  square  wave  with  linear  transi¬ 
tions  of  10  ns.  The  equivalent  peak  optical  power  incident  on  the  photodetector 


3.2.  STATIC  RECEIVER  RESPONSE  SKEW  SIMULATIONS 


29 


Figure  13:  SPICE  simulation  of  4>i  (solid)  and  <i>2  (dEished)  clock  phases  for  trans- 
impedance  receiver  and  two-phase  clock  synthesizer. 


(for  a  square  wave  with  a  50  per  cent  duty -cycle)  which  is  expected  to  achieve  such 
a  current  level  is  on  the  order  of  -10  dBm  for  visible  wavelengths.  The  capacitive 
load  is  2.5  pF,  and  represents  the  fanout  for  a  typical  functional  cell  described  in 
Section  2.1. 

Parameter  variations  between  transistors  which  are  closely  spaced  on  a  given 
chip  tend  to  be  highly  correlated.  For  example,  if  an  NMOS  transistor  has  a  chan¬ 
nel  dimension  that  is  0.1  /zm  longer  than  the  ideal  value,  all  NMOS  transistors 
in  that  general  area  of  the  chip  surface  tend  to  have  the  same  error.  Parameter 
errors  for  devices  separated  by  large  distances  on  a  given  chip  tend  to  be  more 
independent.  For  the  simulation  studies  presented  here,  all  of  the  transistor  pa¬ 
rameters  for  similar  devices  in  a  single  circuit  simulation  were  varied  the  same 
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Figure  14:  Worst  case  clock  phase  signal  skew  for  transimpedaince  receiver  and 
clock  driver. 

amount.  Transistor  dimension  variations  were  assumed  to  be  ±0.1pm  and  thresh¬ 
old  voltage  variations  to  be  ±1  mV,  reflecting  parameter  fluctuations  that  might 
be  expected  on  a  given  chip  for  a  good  2-micron  fabrication  process.  Channel 
width  variations  in  these  ranges  caused  no  noticeable  changes  in  the  response  of 
the  receiver  or  clock  driver  output.  The  worst  case  output  signal  variations  for 
changes  in  channel  length  and  threshold  voltages  are  shown  in  Figure  14. 

In  Figure  14,  the  signals  represent  the  minimum  and  maximum  throughtput 
delays  for  the  output  of  the  clock  driver  from  point  B  in  the  circuit  diagram  of 
Figure  5.  This  is  the  (f>i  clock  signal.  The  separation  between  curves  represents  the 
receiver  skew  that  is  expected  due  to  fabrication-related  differences  in  transistor 
parameters.  The  transition  time  for  each  signal  is  around  1  ns,  and  the  maximum 
response  skew  is  around  3  ns.  These  values  are  representative  of  a  ver\'  simple 
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Figure  15:  (solid)  and  <^2  (dashed)  receiver-driver  clock  phases  response  for  67 

MHz  optical  input  signal. 

photoreceiver  and  clock  driver  circuit  which  has  not  been  optimized  with  respect 
to  minimizing  receiver  skew. 

In  another  series  of  simulations,  the  input  optical  clock  frequency  weis  varied 
to  determine  a  maximum  operating  frequency  for  the  circuit.  This  simulation 
assumes  a  circuit  which  is  not  limited  by  photodiode  carrier  diffusion  delays,  but 
rather  provides  an  indication  of  the  maximum  frequency  of  the  amplifier  and  clock 
driver  electronics.  The  receiver  and  clock  driver  were  found  in  the  simtilations 
to  produce  valid  digital  output  levels  for  the  two  clock  phases  up  to  around  67 
MHz,  cis  shown  in  Figure  15;  however,  for  reliable  clock  distribution,  the  expected 
receiver  skew  must  be  accounted  for.  This  allows  clock  frequencies  of  up  to  50 
MHz,  beised  on  the  simple  circuit  used  in  these  simulations.  A  more  sophisticated 
circuit  design  might  easily  achieve  a  higher  clock  rate. 
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3.3  Signal  Jitter  Analysis 

The  static  receiver  response  skew  that  has  been  presented  represents  only  one  type 
of  timing  uncertainty  that  is  inherent  in  a  synchronous  system  with  distributed 
clock  receivers  and  drivers.  For  the  static  timing  uncertainty,  each  receiver  has  a 
specific  response  delay,  and  the  variation  of  delays  between  receivers  results  in  a 
signal  skew.  A  second  type  of  timing  uncertainty  involves  the  temporal  variation 
in  the  transitions  of  the  clock  signal  at  each  receiver  due  to  noise  in  the  system. 
This  type  of  timing  uncertainty  is  commonly  called  phase  jitter.  This  section  is  a 
presentation  of  the  phase  noise  analysis  of  the  transimpedance  receiver  described 
in  Section  3.1. 

Since  phase  noise  is  stochastic  in  natiu'e,  phase  jitter  is  commonly  expressed  in 
mean  square  units  of  radians^  or  degrees^.  For  purposes  of  comparing  the  stability 
of  clock  signals,  it  is  sometimes  more  helpful  to  express  the  jitter  in  terms  of  time 
units  rather  than  phase  units.  This  is  especially  useful  when  comparing  receivers 
that  have  different  operating  frequencies,  as  is  done  in  Chapter  4.  The  analysis 
in  this  section  will  be  first  expressed  in  terms  of  phase  units,  then  converted  to 
time  units  for  a  typical  frequency  of  operation  that  is  expected. 

In  order  to  characterize  the  phase  noise  present  at  the  output  of  the  receiver, 
a  signal'to-noise  analysis  technique  for  phase  and  frequency  modulated  systems 
from  communication  theory  is  employed[34].  The  input  signal  is  approximated  by 
a  sinusoidal  waveform  to  simplify  the  analysis.  The  signal  and  noise  currents  at 
the  receiver  input  are  given  by 


=  ^(1  +  cosa;cf) 

(3) 

=  inc{t)  cos ivj  -  in,{t)  sin  UJct 

(4) 

where  Iph  is  the  amplitude  of  the  clock  signal  at  the  input  of  the  detector, 
and  in,  are  the  quadrature  representations  of  the  bandpeiss  noise,  and  oJc  is  the 
frequency  of  the  clock  signal.  From  Equation  3  and  Equation  4,  the  a-c  portion 
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of  the  current  at  the  input  to  the  receiver  can  be  represented  ais 

*in(<)  =  ^  costJct  +  inc(<)  cosu^t  -  sinw,.*  (5) 

=  [(Iph/2)  +  inc(t)]  cosuict  -  in*(t)  sinw^t  (6) 

=  R(t)  cos[u>ct  +  ^(t)]  (7) 

where  R(t)  is  the  time-varying  envelope  and  9{t)  is  the  phase  variation  due  to 
noise,  as  determined  by 

R(t)  =  V'((/,h/2)  +  (8) 

_  _ 1. _  *nj(0  /n^ 


0(t)  =  arctan 


{Iphl^)  +  *nc(0 


In  the  standard  analysis,  as  is  the  case  for  the  receiver  that  has  been  presented, 
it  is  assumed  that  the  received  signal  is  eventually  clipped;  therefore,  the  time- 
varying  envelope  can  be  approximated  by  the  time  invariant  amplitude  of  the 
signal  after  being  limited,  Ai,.  The  output  of  the  limiter  is  then  given  as 

iL(t)  =  Al  cos[uJcf  +  0(t)]  (10) 

and  the  time  variation  is  represented  by  the  phase  term,  9{t).  For  cases  when  the 
signal  amplitude  is  much  larger  than  the  noise  components,  the  phase  noise  can 
be  approximated  as 

0(t)  « - (11) 

The  total  mean  square  noise  is  given  by 
*2  '2 

*n  =  ^(1  +  cos2a;c<)  -|-  -|^(1  -  cos2u;ct)  -|-  2incin*(cosu;c<sina;ct).  (12) 

Since  the  second  harmonics  are  outside  the  paissband  of  the  circuit,  and  the 
quadrature  noise  components  are  independent.  Equation  12  becomes 

+  (13) 

Assuming  that  =  t^,  the  mean  square  phase  noise  from  Equation  11  is 

^  /J,' 


(14) 
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A  circmt  model  for  noise  analysis  of  the  first  stage  amplifier  is  shown  in  Fig¬ 
ure  16.  This  model  represents  the  small  signal  nature  of  the  first  stages  of  the 
receiver  amplifier  chain.  In  this  model,  the  noise  sources  are  independent  of  each 
other,  i'pj  is  the  mean  square  noise  current  at  the  photodetector.  Noise  due  to 
the  optical  source  such  as  relative  intensity  Iciser  noise  can  be  included  here,  and 
the  combination  of  thermal  diode  junction  noise  and  the  source  noise  is  given  as 

^Ipd  =  ^kTB/rj  +  (15) 

where  k  is  Boltzmann’s  constant,  T  is  the  temperature  in  Kelvin,  B  is  the  noise 
bandwidth  of  the  receiver,  and  rj  is  the  reverse  bias  junction  resistance  of  the 
photodiode.  The  shot  noise  portion  of  is  neglected,  as  is  the  1/f  noise.  These 
are  good  assumptions  due  to  the  reverse  bicis  on  the  diode  and  the  high  frequency 
operation,  respectively,  is  the  shot  noise  due  to  leakage  in  the  gate  of  the 
transistor,  represented  by 

ii;=2<z4S  (16) 

where  q  is  the  electron  charge  and  Ig  is  the  gate  leakage  current.  i\j  is  the  thermal 
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noise  due  to  the  resistance  of  the  feedback  path,  calculated  as 


ilj  =  AkTB/Rft. 

(17) 

*ndr  drain-source  shot  noise,  represented  by 

=  4kT(2/3)g^B 

(18) 

where  is  the  transconductance  of  the  pull-down  transistor  for  the  first  stage. 
Again,  1/f  noise  has  been  neglected  in  the  calculation  of  the  channel  noise  current, 
is  the  thermal  noise  due  to  the  load  resistance,  given  by 

(19) 


The  equivalent  mean  square  noise  voltage  at  the  input  to  the  second  stage  is 
determined  by  a  similar  analysis  for  noise  sovuces  in  that  stage,  and  is  given  as 

^  B{2Bk/,  +  «r(2/39„)ll+u.2Bi(C*  +  C,)=l) 

- l+u;={B^C.;)^ - 

This  calculation  assumes  that  the  second  stage  dimensions  are  identical  to  those 
of  the  first  stage;  therefore,  the  load  resistances,  gate  capacitances,  and  trainscon- 
ductances  are  equal. 

For  the  noise  analysis,  the  effects  of  each  source  on  the  output  voltage  of 
the  first  stage  are  determined,  and  an  equivalent  input  current  source  for  each 
is  derived.  Since  and  appear  at  the  input,  there  is  no  need  to  calculate 
equivalent  values  for  them.  The  equivalent  input  noise  due  to  is 

Tj -  —  Rfb[(9mRL  -  (^'^RLCgTL)  +  jujgmRLTr.  "  CgRl)]  ^ 

^neq,f  ^nf  _  i^^GTfTo)  +  ju!{Tf  +  -  gmRfbTL)] 

where  =  RiCdr,  tj  =  R/bCgd,  G  =  {1  +  g^Ri),  and  Ca  =  {Cg  -f-  Cg^  -t-  Cj^). 
The  equivalent  input  noise  due  to  z^^^  is 


Tj —  -^£,{[1  +  "JgmRL  -  U^iGTqTj  -f-  RjbCgTi)]  -f-  j(jo[G{To  +  Tf)  +  RfbCg]} 
i2l,[(l  -  gmRjb  -  (J^GtjTq)  -I-  +  Tl-  gmRjbTL)] 
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where  tq  =  tl/ G.  The  equivalent  input  noise  due  to  is  similar,  and  given  as 


j2  — 

''neq,L 


(23) 


,•2 


+  2gmRL  —  UJ^iGroTf  +  RfbCgTL)]  +  ju)[G(TG  +  Tf)  +  RfbCg]} 


i2z,[(l  -  gmRfb  -  Uj'^GtjTg)  +  joj{Tj  +  ri,  -  gmRjbTL)] 
The  input  equivalent  noise  due  to  the  next  stage  is 


‘'nc^,n 


u^CKI+oj^tJ)  _ 

1  +  (Cai2/6 -t- 


(24) 


The  total  input  mean  square  noise  current  is  the  sum  of  the  independent 
equivalent  input  noise  sources,  or 


,•2  _  ,'2  .1.  :2  4.  ,'2  i  :2  r  -2  .  ,'2 

‘n«9,t  '’neq,pd  '  ‘'neq,g  ‘  ''neq,/  ~  ‘'neq,dT  '  ‘'neq,L  '  ‘'neq,pd' 


(25) 


The  total  equivalent  input  noise  can  then  be  compared  to  the  mean  square  pho¬ 
tocurrent,  and  an  equivalent  input  SNR  can  be  determined. 

An  important  parameter  in  the  quantification  of  the  receiver  noise  is  the  input 
noise  bandwidth  of  the  transimpedance  amplifier.  In  order  to  determine  this 
bandwidth,  the  output  voltage  characteristic  transfer  function  heis  been  developed 
from  circuit  analysis.  The  transfer  function  is  given  as 


where 


and 


Vo/iiu  =  N(s)/D{s)  (26) 


N{s)  =  i2£,[(l  “  dmRjb)  +  (TJ  +  Gtg  —  gmRfbTL)s  +  Gt/Tos'^]  (27) 


D{s)  —  G  +  \G{Tf  Tg  -{•  T£,)  -f  (i2/6  +  ■R£,)Ga]s  A 

[G{tjTg  +  Tl{tj  -I-  Tg))  +  2RfhCaTL  -f  RtCairf  -+-  r£,)]s^  + 
[GtiTjTg  -f  RfbCaXl  +  RLCaTfTL]s^.  (28) 

In  order  to  determine  typical  noise  performance  for  the  transimpedance  re¬ 
ceiver,  and  to  compare  it  with  the  performance  of  a  phase-locked  loop  receiver 
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Symbol 

Value 

Symbol 

Value 

Rl 

2kft 

Ci 

66.3  nF /cm^ 

Rfb 

20kn 

c. 

2  pF 

IMH 

Cfj, 

0.52  pF 

9rn 

0.0134  fi-* 

C,, 

0.54  pF 

10.8  ns 

Cdr 

4.3  pf 

n 

12.6  ns 

B 

35  MHz 

Table  1:  Circuit  element  values. 


found  in  Chapter  4,  the  input  noise  bandwidth  and  circuit  model  element  values 
have  been  calculated,  based  on  measured  parameters  from  the  MOSIS  fabrication 
process  for  SPICE  simulation  input.  These  values  are  shown  in  Table  1,  and  rep¬ 
resent  a  transimpedance  amplifier  implementation  with  3  micron  channel  length 
transistors. 


For  the  example  circuit  parameters  given  in  Table  1,  the  passbands  of  the 
individual  input  equivalent  noise  sources  are  limited  by  the  input  noise  passband. 
The  input  equivalent  noise  currents  for  the  example  can  therefore  be  approximated 
as 


.’2 

'nev,/  ~  *n/ 


j2  -2  [2/2/1, 

*neg,dr  ~  ‘ndr 


T  R 


*ncg,L 


^[2/2/6 


L  J 

n2 


L  R 


L  J 


p  W  — 

«,.n  ^2^ 


(29) 

(30) 

(31) 

(32) 


The  respective  contributions  of  the  individual  noise  sources  and  the  corre¬ 
sponding  mean  square  equivalent  input  noise  current  for  the  circuit  parameters 
shown  in  the  example  are  listed  in  Table  2.  The  total  equivalent  input  mean 
square  noise  current  for  this  receiver  can  be  approximated  as  1.91x10"*®  (A^). 
The  estimated  mean  square  phase  jitter  can  be  determined  from  Equation  14.  For 
the  example  given,  this  is  7.64xl0"®radians^.  The  rms  phase  jitter  is  2.76x10"^ 
radians. 
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Somree 

Mean  Square 
Noise 

Mean  Square  Eq. 
Input  Noise 

(A^) 

mm 

5.76x10“*® 

5.76x10“*® 

1.12x10“^® 

1.12x10“^® 

mm 

2.88x10“*’’ 

2.88x10“*’ 

mm 

5.15x10“*® 

1.03x10“*® 

mm 

2.88x10“*® 

5.77x10“*’ 

3.26x10“’® 

Table  2:  Individual  noise  source  values  and  equivalent  input  roise  values  for  3- 
micron  example. 

In  order  to  compare  the  magnitude  of  the  noise-induced  timing  uncertainty 
for  signals  of  different  frequencies ,  a  conversion  from  rms  phase  jitter  in  units  of 
radians  to  rms  time  jitter  in  units  of  seconds  can  be  performed.  If  the  trans¬ 
impedance  amplifier  frequency  were  not  limited  by  slew  rate,  but  rather  by  the 
noise  bandwidth  of  the  receiver,  maximum  clock  frequency  for  the  example  given 
would  be  around  35  MHz.  This  corresponds  to  an  rms  signal  jitter  of  12.5  ps.  In 
reality,  however,  the  maximum  clock  frequency  is  likely  to  be  limited  by  the  slew 
rate  of  the  amplifier.  In  Chapter  5,  laboratory  measurements  for  test  receivers 
implemented  in  3-micron  CMOS  indicate  that  the  maximum  clock  frequency  for 
reasonable  optical  input  power  is  around  15  MHz.  The  phaise  jitter  for  a  15  MHz 
clock  signal  based  on  the  example  circuit  parameters  corresponds  to  an  rms  signal 
jitter  of  29.3  ps. 

For  the  purpose  of  estimating  the  noise  performance  of  an  implementation  of 
the  transimpedance  amplifier  with  2  micron  channel  length  transistors,  the  noise 
bandwidth  is  scaled  by  a  factor  of  1.5  to  52.5  MHz.  The  equivalent  mean  square 
noise  current  similarly  scales  to  2.86x10“*®  A^,  and  the  mean  square  phase  noise 
becomes  1.15x10“®  radians^  with  an  rms  value  of  3.39x10“®  radians.  For  an 
operating  frequency  of  50  MHz,  the  skew-limited  maximum  operating  frequency 
for  a  2-micron  design  as  simulated  in  Section  3.2,  the  rms  signal  jitter  is  estimated 
as  10  ps. 
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The  noise  contribution  to  timing  uncertainty  for  the  transimpedance  amplifier 
that  has  been  presented  in  this  chapter  is  two  orders  of  magnitude  less  than 
the  static  skew  that  is  expected  for  the  same  circuit.  For  this  reason,  the  jitter 
can  be  neglected  in  determining  the  maximum  operating  frequency  for  the  clock. 
The  skew  contribution,  however,  is  quite  large.  This  receiver  design  provides 
no  real  synchronization  improvement  over  metal  wired  distribution  systems  for 
chip  sizes  less  than  1  cm,  while  requiring  an  inhibitively  lairge  amount  of  layout 
area  for  implementation  of  each  receiver.  While  some  application  might  exist 
for  wafer  scale  clock  distribution  using  a  transimpedance  amplifier  approach,  a 
receiver  design  with  better  performance  and  less  layout  overhead  is  required  for 
single  chip  optical  clock  distribution.  A  phase-locked  loop  receiver  with  picosecond 
static  response  skew  and  sub-picosecond  rms  signal  jitter  is  presented  in  Chapter  4 
which  is  expected  to  require  much  less  chip  area  and  will  operate  for  a  100-200 
MHz  clock  signal. 
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Chapter  4 


Phase-Locked  Loop  Receiver 


The  limitations  of  the  performance  of  the  transimpedance  MOS  receiver  make  it 
inappropriate  for  chip  level  optical  clock  distribution  systems.  The  low  gain,  poor 
response  time,  large  response  skew,  and  massive  layout  area  requirements  indicate 
that  an  alternate  receiver  approach  must  be  used  if  multiple  receivers  are  to  be 
implemented  on  a  single  chip  with  performance  superior  to  existing  metal  clock 
distribution  systems. 

An  attractive  alternative  receiver  is  the  phase-locked  loop  receiver  that  is  pre¬ 
sented  in  this  chapter.  This  receiver  is  based  on  a  voltage  controlled  ring  oscillator, 
which  allows  implementation  with  significantly  lower  transistor  size  requirements, 
while  operating  in  the  100-200  MHz  frequency  range.  A  description  of  this  circuit 
and  its  operation  is  presented  in  Section  4.1. 

As  with  the  transimpedance  amplifier,  the  phase-locked  loop  receiver  is  subject 
to  static  and  temporal  signal  transition  uncertainties.  Section  4.2  is  a  presenta¬ 
tion  of  fabrication-related  response  skew,  while  Section  4.3  is  a  description  of  the 
signal  jitter  due  to  noise  at  each  receiver.  For  each  case,  the  phase-locked  loop 
with  the  same  optical  input  power  represents  orders  of  magnitude  improvement 
over  the  transimpedance  receiver,  as  is  demonstrated  by  examples  of  a  2-micron 
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Phase  detector 


Loop  filter 


Figure  17:  A  phase-locked  loop  block  diagram. 

implementation  of  the  phase-locked  loop  receiver  and  comparisons  with  the  cor¬ 
responding  examples  for  the  transimpedance  receiver  presented  in  Chapter  3.  A 
comparison  of  incident  optical  power  levels  required  for  the  two  receiver  designs 
to  achieve  an  arbitrary  minimum  timing  uncertainty  is  presented  in  Section  4.4. 


4.1  Circuit  Descriptions 

Phase-locked  loop  receivers  operate  by  matching  the  phase  and  frequency  of  a 
locally  generated  signal  with  an  external  waveform.  Traditionally,  a  phase-locking 
circuit  consists  of  a  voltage  controlled  oscillator  (VCO),  a  phase  detector,  and  a 
loop  filter,  as  shown  in  Figure  17. 

The  phase  detection  and  filtering  produce  a  control  voltage  which  is  used  as 
an  input  to  the  voltage  controlled  oscillator  and  ultimately  controls  the  output 
frequency  and  phase  of  the  loop.  The  phase  detector  measures  the  separation 
of  the  input  and  VCO  output  signals.  The  pheise  error  voltage  is  a  function  of 
the  time  separation  between  the  corresponding  pulse  edges  of  the  two  signals. 
The  two  signals  become  phcise-locked  when  the  phase  separation  between  the  two 
signals  becomes  a  contant  value.  By  adjusting  the  control  voltage  for  the  VCO, 
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the  output  frequency  and  phase  are  adjiisted  until  the  separation  between  the 
signals  reaches  a  steady-state  value,  bringing  the  VCO  output  signal  into  lock 
with  the  input  signal. 

There  are  typically  three  input  frequency  ranges  that  can  be  used  to  describe 
the  phase  aquisition  and  tracking  ability  of  phase  locked  loops.  These  ranges 
are  centered  around  the  output  frequency  of  the  VCO.  The  first  two  ranges  de¬ 
scribe  the  dynamic  capability  of  the  loop  to  track  or  reacquire  an  input  signal 
with  modulation.  The  third  describes  the  ability  of  the  loop  to  lock  to  a  single 
frequency. 

The  narrowest  range,  the  lock-in  range  or  Aa;^,,  describes  the  maximum  input 
frequency  shift  for  which  the  loop  can  maintain  phase-lock  between  the  input 
and  VCO  signals.  Any  frequency  steps  less  than  AuJc  for  the  input  signal  can 
be  followed  by  the  VCO  without  losing  phase  lock.  If  the  input  frequency  is 
modulated  outside  this  range,  the  phase  locking  between  the  two  signals  can  slip 
a  few  cycles,  and  perhaps  be  recovered.  This  is  relevant  to  frequency  or  phase 
modulation  receivers,  because  it  defines  the  bandwidth  of  the  input  modulation 
frequency  that  can  be  tracked,  but  it  has  limited  meaning  for  use  with  a  single 
frequency  clock  signal  input. 

The  second  range,  the  pull-in  range  or  Aujp,  describes  input  frequencies  which 
can  be  pulled  in  to  lock  after  slipping  for  a  munber  of  cycles.  Any  frequency  steps 
in  this  reinge  between  the  input  frequency  and  the  VCO  output  can  cause  the 
phzise  to  slip  several  cycles,  but  the  system  will  again  return  to  a  locked  condition 
eventually.  The  time  required  to  return  to  lock  is  dependent  on  the  dynamics  of 
the  phase  locked  loop  and  the  magnitude  of  the  frequency  step.  Again,  for  clock 
operation,  this  parameter  has  limited  meaning,  because  the  input  frequency  for 
the  clock  signal  will  only  be  stepped  when  the  clock  signal  is  turned  on. 

The  widest  range,  the  hold  range  or  Aw//,  describes  the  input  frequency  range 
in  which  a  steady-state  phase  error  exists.  It  is  primarily  defined  by  the  saturation 
limits  of  the  phase  detector.  Input  frequencies  that  differ  from  the  VCO  free- 
rxmning  signal  by  more  than  ±Au>//  have  an  expected  error  voltage  that  excedes 
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Phase  detector 


Figure  18:  A  phcise-locked  loop  for  optical  clock  detection. 

the  output  voltage  available  from  the  phase  detector,  and  therefore,  the  loop  will 
not  lock.  The  VCO  can  lock  to  a  single  frequency  clock  signal  inside  the  hold 
range,  however  changes  in  the  input  frequency  could  result  in  instability  in  which 
the  loop  can  become  momentarily  unlocked,  if  the  changes  are  larger  than  Awp. 

Phase-locking  systems  can  be  designed  with  either  digital  components,  or  as 
analog  circuits.  Digital  phase-locked  loops  have  the  advantage  of  programmability 
and  digital  level  output  signals,  but  operate  at  relatively  low  frequencies.  Analog 
phase-locked  loops  can  operate  at  higher  frequencies,  but  typically  require  large 
devices  for  implementing  the  VCO  and  loop  filters,  while  usually  delivering  a 
sinusoidal  small  signal  output. 

The  receiver  presented  here  is  a  hybrid  of  analog  and  digital  approciches,  mod¬ 
eled  after  those  recently  reported  in  the  literature[21,27].  Chown  in  Figure  IS, 
this  circuit  has  analog  RC  low  pass  filters,  an  analog  phzise  detector,  and  a  volt¬ 
age  controlled  ring  oscillator.  The  ring  oscillator  provides  a  digital  output  level 
signal  which  can  then  be  used  as  an  input  for  the  two-phase  clock  synthesizing 
circuit  described  in  Section  3.1.2,  while  being  implemented  with  minimnm  size 
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Figtire  19:  Timing  diagram  of  input  signal,  VCO  output  signal,  and  photodiode 
output  for  phase  detection. 


transistors. 

The  design  reported  by  Fried  [21]  has  been  modified  by  actively  biasing  a 
photodiode  with  the  VCO  output  signal  to  allow  photodetection  and  phEise  com¬ 
parison  together,  eliminating  the  need  for  a  separate  signal  multiplying  device. 
When  the  output  voltage  of  the  VCO  is  high,  the  photodiode  is  reverse  biased, 
and  a  photocurrent  flows  when  light  is  present  on  the  diode.  When  the  output 
voltage  of  the  VCO  is  low,  the  photodiode  is  forward  biased,  and  any  photocur¬ 
rent  is  negligible  when  compared  to  the  forward  diode  current.  In  this  manner, 
Vd,  the  diode  voltage  indicated  in  Figure  18,  is  modulated  between  three  states, 
ais  shown  in  Figure  19.  The  voltage  levels  for  the  three  states  of  are:  the 

forward  biased  jimction  voltage  for  the  diode,  Vrv,dk^  the  reverse  biased  junction 
voltage  with  no  illumination,  and  Kv.i//,  the  reverse  biased  junction  voltage  with 
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Figure  20:  Phase  detector  conversion  characteristic  as  a  function  of  signal  sepa¬ 
ration,  r. 


illumination,  where 


kT 

•rr  _  ^ddHpd 

"  Rpu  +  Rpd 


Vrv.iU  = 


VddRdd  +  IphP  puRpd 


■  P  _1_  P  ' - ' 

d^pu  I  dXpd 

The  phase  error  signal  for  the  loop  in  this  phase-locked  loop  is  the  DC  term  of 
the  output  signal  from  the  photodiode,  and  is  produced  by  low  pass  filtering  Vj. 

The  characteristic  phcise  detector  output  voltage  from  the  low  pass  filter  is 
shown  in  Figure  20,  and  is  the  time  average  of  the  voltage  on  the  photodiode. 


given  as 


= 


^dd^pd  I  ^ph  ^pd 
Rpu  +  Rpd  Rpu  -^Rpd 


'^pu+Rpd  Rpu+Rp<^ 


+  for0<r<f 


+  P/wj  + 


■rlphUpuRpA  f  L  <r  T  <T 

T(/?p«+Hprf)  2  - 


where  r  is  the  time  separation  between  the  VCO  output  pulse  edge  and  the  optical 
input  signal  pulse  edge.  For  a  square  wave  optical  input  signal,  the  characteristic 
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function  of  the  phase  detector  is  linear,  as  shown,  and  the  conversion  gain,  Jv'j,  is 
given  as  the  slope  of  this  characteristic,  or 


pti 

\N-d\  —  {T/2)  IphReq/T^ 


(37) 


where  i2p„  and  Rj^  are  the  pull-up  and  pull-down  resistors  in  a  voltage  divider, 
Req  is  the  equivalent  parallel  resistance  of  i?pu  and  Rpdi  and  T  is  the  period  of  the 
VCO  output  signal.  The  maximum  and  minimum  values  of  Vj  are  given  cis 


^ddRpd  4 


Rp» 


■Kpd 

i 


+ 


(38) 


and 


respectively. 


V2  =  - 


Vfi^Rpd 

2  [.ftpu  +  Rpd 


fwd 


(39) 


The  operation  of  low  order  phase-locked  loops  is  well  understood  [35,36].  In 
general,  the  order  of  a  phase-locked  loop  is  determined  by  the  Laplace  transform 
of  the  phase  transfer  function  for  the  loop.  Since  the  phase  of  the  VCO  output 
signal  can  be  represented  as  the  integral  of  the  frequency  of  that  signal,  the  transfer 
function  for  the  VCO  block  in  the  loop  can  be  represented  as 


Hy^o{s)  =  K„ls  (40) 

where  Kg  is  the  voltage-to-frequency  conversion  gain  of  the  VCO.  If  the  phase- 
to- voltage  conversion  gain  of  the  phase  detector  is  Jvj,  as  defined  in  Equation  37, 
and  the  transfer  function  of  the  loop  filter  is  F{s),  then  the  loop  phase  transfer 
function  can  be  represented  as 

0„co(^)  _  KqlQFjs) 

0.„(s)  s  +  KKdF{sy 

While  simpler  phase-locked  loops  could  be  used  for  clock  receiver  designs,  per¬ 
haps  the  best  is  a  second  order  loop  with  a  lag-lead  low  pass  filter,  shown  in 
Figure  21.  This  design  allows  a  wide  frequency  range  for  operation,  has  indepen¬ 
dent  noise  bandwidth  and  loop  gain,  is  eeisily  implemented  with  reasonable  layout 
requirements,  and  has  a  small  steady-state  phase  error. 
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Ry 


Figure  21;  Lag-lead  low  pass  loop  filter. 


The  tranfer  function  of  the  lag-lead  low  pass  loop  filter  is  given  as 


F(s) 


ST2  +  1 
4-  1 


(42) 


where  ti  =  (Ri  ■i-R2)C  and  r2  =  RyC.  The  transfer  function  for  the  phase-locked 
loop  with  this  type  of  filter  can  be  represented  as  [37] 


H(s) 


_ KoK4(st2  -H  l)/ri _ 

-t-  s(l  4-  KoKdT2/Ti)  4-  KaKd/n  ' 


(43) 


Typically  the  transfer  function  is  represented  in  a  canonical  form  for  second 
order  circuits,  or 

^  3[2Ca>n  -  ujl/iKM]  + 

4-  2Qu>nS  4-  u;2 

where  w„  is  the  natural  frequency  of  the  second  order  system,  and  is  a  damping 
factor.  By  matching  the  coefficients  of  Equation  43  and  Equation  44,  and  C 
can  be  expressed  in  terms  of  /vj,  Ko,  D  and  T2,  yielding 


(46) 


Most  of  the  interesting  operating  parameters  of  the  phase-locked  loop  can  be 


expressed  in  terms  of  Un,  C»  and  the  conversion  gains. 
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For  the  noise  analysis  that  is  presented  in  Section  4.3,  the  loop  bandwidth, 
Bi,,  is  of  particular  interest.  The  noise  bandwidth  is  defined  as  [38] 

Bc=  r  lll(j27rf)l^df  (47) 

JO 


where  H(j2Trf)  is  the  one-sided  transfer  function  of  the  loop  to  be  consistent  with 
Gardner’s  phase-locked  loop  terminology  [38].  For  a  second  order  loop  with  a 
lag-lead  pztssive  low  pass  filter,  is 


Bl  = 


+  KoKa-Ti) 


Hz. 


(48) 


The  noise  bandwidth  has  dimensions  of  hertz  rather  than  radians/second  to  be 
compatible  with  Gardner’s  terminology  in  which  noise  power  spectral  density  is 
expressed  in  units  of  watts/hertz. 


In  addition  to  the  noise  bandwidth  of  the  loop,  the  phase  acquisition  frequency 
ranges  that  have  been  described  be  determined[39].  For  the  second  order  loop  with 
a  passive  lag-lead  filter,  the  hold  range  is 


Auh  =  KJUF{0)  =  KJQ.  (49) 

This  is  the  frequency  parameter  of  primary  concern  for  clock  receiver  designs; 
however,  for  means  of  comparison,  the  pull-in  range  for  the  same  system  with 
sinusoidal  phase  detector  characteristics  when  KgKd  ^  ^ni  can  be  expressed  as 

Acjp  «  —\J(^u}nKoNd  —  (50) 

The  lock-in  range  for  second  order  loop  with  a  sinusoidal  phase  detector  can  be 
approximated  as  the  natural  frequency  of  the  loop,  or 


Aul  ^  Wn-  (51) 

Since  the  phase  detector  characteristic  for  the  optical  clock  receiver  is  triangular, 
rather  than  sinusoidal,  the  lock-in  and  pull-in  ranges  are  extended  by  a  factor  of 
7r/2  [40]. 
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Parameter 

Value 

Parameter 

Value 

Ri 

32.3  kQ 

C 

0.5  pF 

R2 

31.4  kfl 

n 

31.4  ns 

Req 

31.4  kO 

T2 

15.3  ns 

Table  3:  Typical  phase-locked  loop  filter  and  phase  detector  parameters. 


Two  time  parameters  predicting  the  acquision  process  are  Tp,  the  time  to  pull 
the  phase  to  within  27r  of  the  input  signal,  and  T[,,  the  time  for  the  phase  to  lock, 
once  the  frequency  is  pulled  into  the  lock  range.  As  might  be  expected,  the  puU-in 
time  is  generally  much  longer  than  the  lock-in  time.  For  second  order  loops  [39] 


Tp 


(52) 


where  Awq  is  the  difference  between  the  input  frequency  and  the  center  frequency 
of  the  VCO.  The  lock-in  time  for  second  order  systems  can  be  approximated  as 


Tl  «  l/w„. 


(53) 


In  order  to  estimate  the  operating  parameters  of  a  typical  implementation  of 
the  voltage  controlled  ring  oscillator  with  minimum  size  inverter  gates  and  2  mi¬ 
cron  channel  lengths,  SPICE  simulations  have  been  performed  to  determine  the 
voltage-to-frequency  conversion  gain.  In  this  study,  the  control  voltage  for  the 
pass  transistors  was  varied  and  the  simulated  VCO  frequency  was  determined. 
The  resulting  VCO  frequency  characteristic  is  shown  in  Figure  22.  For  this  inves¬ 
tigation,  NMOS  pass  transistors  of  minimum  size  were  used,  allowing  the  highest 
operating  frequencies  for  the  VCO.  The  result  was  linearized  to  obtain  a  conver¬ 
sion  gain  of  Ko  —  27r(128)xl0®  radians/V-s. 

Parameters  for  the  phase  detector  and  loop  low  pass  filter  that  might  be  con¬ 
sidered  typical  for  a  2-micron  implementation  of  this  phase-locked  loop  are  shown 
in  Table  3.  These  estimates  reflect  the  values  that  might  be  implemented  with 
reasonable  layout  area  requirements. 

Given  the  filter  parameters  shown  in  Table  3  and  assuming  a  photocurrent  of  10 
fiA,  an  estimate  of  typical  operating  parameters  for  the  phase-locked  loop  can  be 


Output  frequency,  (MHz) 
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Figure  22:  VCO  characteristic  curve  for  implementation  with  2-micron  channel 
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Parameter 

Symbol 

Value 

VCO  conversion  gain 

Ko 

804x10®  r-’dians/s-V 

Phase  detector  conversion  gain 

IQ 

0.1  V/radian 

Loop  natural  frequency 

50.6x10®  radians/s 

Loop  damping  factor 

C 

0.7 

Noise  bandwidth 

14.4  MHz 

Hold  range 

Auff 

80.4x10®  radians/s  (12.8  MHz) 

Pull-in  range 

^(jjp 

80.4x10®  radians/s  (12.8  MHz) 

Lock-in  range 

Aujl 

79.5x10®  radians/s  (12.7  MHz) 

Pull-in  time 

Tp 

35.6  ns 

Lock-in  time 

Tl 

19.8  ns 

Table  4:  Typical  phase-locked  loop  parameter  for  a  2-micron  CMOS  implementa¬ 
tion. 

determined.  These  are  shown  in  Table  4.  The  pheise  detector  conversion  gain,  Kj, 
given  the  expected  photocurrent  and  equivalent  input  resistance,  is  0.1  V/radian. 
As  indicated  ii..  the  table,  the  natural  frequency  of  the  second  order  loop,  u;„,  is 
50.6x10®  radians/s,  and  the  damping  factor,  is  0.7.  In  this  design,  has  been 
optimized  by  adjusting  the  pole  and  zero  of  the  loop  filter  to  allow  minimum  decay 
time  for  the  transient  response  of  the  loop.  The  noise  bandwidth  for  the  phase- 
locked  loop,  from  Equation  48,  is  14.4  MHz.  Using  the  estimat-«  i  values  of  u;,,  and 
C,  the  estimated  hold  range  for  the  phase-locked  loop  is  80. 4x  10®  radians/s,  or  12.7 
MHz.  Since  the  relationship  KoK^  is  not  true  for  this  example,  Equation  50 

is  not  valid,  and  the  pull-in  range  is  equal  to  the  hold  range.  While  the  lock-in 
range  is  not  a  critical  parameter  for  clock  syncronization,  assuming  a  triangular 
phase  detector  characteristic,  it  can  be  estimated  as  79.5x10®  radians/s,  or  12.7 
MHz.  The  pull-in  time  and  lock-in  time  can  be  estimated  as  35.6  ns  and  19.8 
ns,  respectively,  yielding  a  total  expected  acquisition  time  of  55.4  ns  for  an  input 
frequency  of  maximum  separation  from  the  free-running  oscillator  frequency. 

The  phase- locked  loop  receiver  that  has  been  given  in  this  example  demon¬ 
strates  two  improvements  over  the  transimpcdance  receiver  design  presented  .i 
Chapter  3.  The  operating  frequency  of  the  phase-locked  loop  is  much  higher  than 
the  maximum  available  with  the  traasimpedance  receiver  with  similar  transistor 
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channel  lengths.  The  phase-locked  loop  can  operate  in  the  frequency  range  of  100 
-  200  MHz,  while  the  transirnpedance  receiver  is  limited  to  ahonf  50  ^^Hz.  In 
addition,  the  digital  natiire  of  the  voltage  controlled  ring  oscillator  in  the  phase- 
locked  loop  allows  implementation  with  minimum  size  transistors.  The  largest 
devices  in  the  phase-locked  loop  axe  the  capacitors  used  for  the  low  pass  filters; 
these  can  be  reahzed  using  the  gate  capacitance  of  transistors,  requiring  1/4  of  the 
area  required  for  a  single  input  inverter  in  the  transirnpedance  design.  Overall, 
the  phase-locked  loop  can  be  designed  in  a  layout  area  much  smaller  than  the 
transirnpedance  receiver. 


4.2  Static  Response  Skew 


Just  as  has  been  shown  for  the  transirnpedance  clock  receiver  presented  in  Chap¬ 
ter  3,  identically  drawn  circuit  elements  for  the  phase-locked  loop  receiver  cannot 
be  fabricated  without  small  variations  in  dimensional  and  threshold  parameters. 
These  parameter  variations  between  copies  of  phase-locked  loops  implemented  on 
a  given  VLSI  chip  can  cause  a  variation  in  the  steady-state  phase  error  from  one 
receiver  to  another.  This  difference  in  steady-state  phase  error  represents  the  clock 
skew  in  the  phase-locked  loop  receiver  approach  for  optical  clock  distribution,  and 
is  discussed  in  this  section. 


When  a  p.  eise-locked  loop  reaches  a  locked  state,  a  steady-state  difference 
between  the  input  and  oscillator  signal  phases  is  maintained.  This  steady-state 
phsise  error  is  determined  by  using  the  final  value  theorem  of  Laplace  transforms 
[41].  Ihe  Laplace  transform  of  the  phase  error,  0e(-s),  can  be  represented  jis 


0.(s)  = 


^0.(^) 

s  +  KoK,F{sy 


(54) 


From  the  final  value  theorem,  the  steady-state  time  response  can  be  determined 
by  the  relationship 

lim  y(t)  =  limsK(s). 

»oo  j— *0 


(55) 
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Applying  the  final  value  theorem  to  the  phzise  error,  the  steady-state  phase  error, 
is  given  as 

=  1^0  s  +  idirisy 

To  model  the  steady  state  error  when  a  single  frequency  input  is  applied  to  an 
unlocked  loop,  the  input  phase  is  represented  as  a  step  in  frequency,  Auig/s^,  where 
Awo  is  the  frequency  difference  between  the  input  signal  and  the  free-running  local 
oscillator.  Substituting  this  value  of  0,(s)  in  Equation  56.  the  steady  state  phase 
error  is  represented  zis 

=  lirn - ^ ^ - .  (57) 

s^o  s KJQFis)  KoKdF{0)  ^  ’ 

For  phase- locked  loops  with  passive  loop  filters,  F(0)  =  1,  and  Equation  57  be¬ 


comes 


d,.,  = 


KoKd 


Variations  in  the  device  dimensions  and  thresholds  due  to  the  fabrication  pro¬ 
cess  cause  the  conversion  gains  of  the  VCO  and  the  phase  detector  to  vary  from 
one  receiver  to  another  on  a  chip,  and  consequently  cause  differences  in  steady- 
state  phase  error.  The  maximum  difference  in  steady-state  phase  error  due  to 
device  parameter  variations  can  be  represented  as 


A^jj  —  ^mai  ^min  —  AoJo 


(KoIuUn  (KoK,)r 


where  Omax  and  Ornin  represent  the  steady-state  phase  errors  for  conics  of  the  phase- 
locked  loop  with  the  maximum  and  minimum  cases.  For  a  we  ‘  -r'.-rj  bn-'te, 
AcJo  represents  the  maximum  frequency  difference  between  tV  icnl  •  n  ^ 
and  the  free-running  oscillator,  or  Ate//. 


For  comparison  of  the  static  synchronization  uncertainty  for  the  pha.ce-Ic  "l  cd 
loop  receiver  that  has  been  presented  in  Section  4.1,  SPICE  simulations  have  been 
performed  to  determine  the  range  of  conversion  gains  expected  for  a  phase-locked 
loop  with  2  micron  channel  length  transistors.  The  transistor  channel  dimeiLsions 
were  varied  ±0.1/im,  and  the  threshold  voltages  were  varied  ±1  mV,  as  was  done 


4.3.  SIGNAL  JITTER  ANALYSIS 


55 


for  the  transimpedance  receiver  analysis  in  Section  3.2.  The  minimum  combi¬ 
nation  of  conversion  gains,  {KoKd)r\in,  ’>vas  computed  as  78x10®;  the  maximum 
value,  {KoKd)maxi  was  82x10®.  Using  these  values  in  Equation  59,  along  with 
the  estimate  of  the  hold  range,  Aljh  =80.4x10®,  a  typical  value  of  the  steady- 
state  phase  error  is  0.05  radians.  For  an  operating  frequency  of  1 50  MHz,  this 
corresponds  to  a  timing  skew  of  53  ps.  When  compared  to  the  transimpedance 
receiver  skew  of  3  ns,  the  phase-locked  loop  static  timing  uncertainty  represents 
an  improvement  of  almost  2  orders  of  magnitude. 


4.3  Signal  Jitter  Analysis 

The  phase-locked  loop  receiver  approach  has  been  shown  in  Section  4.1  to  oper¬ 
ate  at  a  higher  frequency  than  is  possible  with  a  transimpedance  receiver,  with 
static  receiver  skew  that  is  orders  of  magnitude  less  than  that  obtained  with  the 
transimpedance  approach,  as  shown  in  Section  4.2.  In  this  section,  a  circuit  noise 
analysis  is  presented,  followed  by  a  timing  jitter  analysis  for  the  phase-locked  loop 
with  a  comparison  of  clock  jitter  for  the  two  receiver  approaches. 

The  primary  sources  of  noise  in  the  phase-locked  loop  receiver  are  the  pho¬ 
todiode  and  the  biasing  resistors.  The  thermal  noise  of  the  pass  transistors  and 
the  inverter  stages  of  the  ring  oscillator  output  signal  is  negligible  due  to  the 
conversion  gains  of  the  VCO  and  the  phase  detector. 

Since  the  photodiode  is  alternately  forward  ar  ’  erse  biased  by  the  output 
signal  of  the  ring  oscillator,  the  noise  characteriau..  of  the  diode  is  not  stationary'. 
By  separating  the  analysis  into  four  czises  of  biasing  and  optical  input  signal 
levels,  however,  the  noise  characteristic  for  the  photodiode  can  be  approximated  as 
stationary,  and  linear  analysis  can  be  performed.  The  four  cases  are  summarized 
in  Table  5.  In  general,  a  given  cycle  of  the  VCO  output  signal  will  cause  the 
photodiode  characteristic  to  have  time  segments  representing  each  case,  depending 
on  the  overlap  between  the  optical  input  signal  and  the  electronic  VCO  output 
signal.  Since  the  noise  contributions  from  each  case  are  different,  the  total  noise 
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Case 

Description 

Case  I 

VCO  output  LOW,  optical  input  OFF 

Ciise  II 

VCO  output  LOW,  optical  input  ON 

Case  III 

VCO  output  HIGH,  optical  input  OFF 

Case  IV 

VCO  output  HIGH,  optical  input  ON 

Table  5:  Photodiode/phase  detector  states  for  noise  analysis. 


Figure  23:  Noise  model  for  phase-locked  loop  photodiode. 

energy  in  a  given  cycle  is  a  function  of  the  overlap  of  the  two  signals.  In  the 
analysis  presented  here,  the  total  noise  energy  for  a  given  cycle  is  given  first  in 
terms  of  the  overlap  between  signals,  then  is  shown  to  be  a  weakly  varying  function 
of  this  overlap,  so  that  the  noise  can  be  approximated  as  stationary  Since  1/f 
noise  is  not  considered  for  this  analysis  due  to  the  high  operating  frequency  of  the 
diode,  the  noise  will  also  be  assumed  to  have  imiform  spectral  density. 

The  noise  in  the  photodiode  circuit  can  be  modeled  eis  a  current  source  in 
parallel  with  the  diode  junction  resistcince  and  capacitance,  cis  shown  in  Figure  23. 
In  this  model,  is  given  by  [42] 

=  4kTB/R,,  -  2qBh  (60) 

where  72^,  is  the  parallel  combination  of  r^,  i2p„  and  R^^,  and  B  is  the  bandwidth 
of  the  circuit. 

For  cases  /  and  //,  the  photodiode  is  forward  biased.  Since  the  photocurrent 
in  II  is  much  smaller  than  the  forward  current  in  the  diode,  I^  for  both  cases  is 
approximately  the  forward  current  in  the  diode.  If.  Because  the  junction  resis¬ 
tance  for  the  forward  biased  diode  is  small,  the  equivalent  resistance,  R^q  can  be 
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approximated  as  rj,  where 


Substituting  Equation  61  into  Equation  60,  the  total  noise  current  for  the  forward 
biased  cases  is 

JlJ  =  4kTB/R,,  -  2kTB/R,,  =  2kTB/R,,.  (62) 


Assuming  a  square  wave  VCO  output  signal  of  approximately  50  per  cent 
duty-cycle,  the  duration  of  the  forward  biased  cases  for  a  given  cycle  is 

tfn,d  =  l/(2/vco)  (63) 

where  f^co  is  the  frequency  of  the  VCO  output  signal.  Combining  Equation  62 
and  Equation  63,  the  total  noise  energy  in  a  given  cycle  due  to  the  forward  biased 
cases  is  represented  as 

Enjiud  —  nr  • 


The  noise  current  for  the  reverse  biased  cases  {III  and  JV),  can  also  be  rep¬ 
resented  in  terms  of  Equation  60.  For  these  cases,  the  diode  current  is  given 


Id.rev  —  ip/i  "b  Ibg  “b  Idk  (66) 

where  Rg  and  I^k  are  the  photocurrent  due  to  background  light  and  the  dark 
leakage  current  of  the  diode,  respectively,  and 


*p/»  — 


0  for  Case  III 
Iph  for  Case  IV 


Combining  Equation  60  and  Equation  65,  the  total  mean  square  noise  current 
for  the  reverse  biased  cases  is 

=  4kTB/R,g  -  2qB{ipH  +  hg  +  Idk).  (67) 


Since  the  junction  resistance  for  the  reverse  biased  diode  is  very  large,  the  equiv¬ 
alent  resistance  for  Equation  67  is  the  parallel  combination  of  Rp^  and  Rpj. 


58 


CHAPTER  4.  PHASE-LOCKED  LOOP  RECEIVER 


For  photocurrent  levels  of  10  fiA  or  less,  the  shot  noise  contribution  of  Equa¬ 
tion  67  is  several  orders  of  magnitude  less  than  the  thermal  noise  contribution, 
and  the  total  equivalent  mean  square  noise  current  for  cases  III  and  IV  can  be 
represented  as 

=  4kTB/R,,.  (68) 

The  duration  of  the  reverse  biased  cases  for  a  given  cycle  of  the  VCO  output 
signal  is 

trev  =  l/(2/„co)  (69) 

The  total  noise  energy  for  a  given  cycle  of  the  VCO  output  signal  due  to  the 
reverse  biased  cases  can  therefore  be  given  as 

4kTB 


Hn,rev  — 


2/„, 


(70) 


The  total  equivalent  input  noise  power  for  phaise-locked  loop  is  the  sum  of  the 
noise  energy  contributions  from  the  forward  and  reverse  biased  cases  divided  by 
the  period  of  the  cycle,  or 


P.n  = 


kTB/f,„  +  2kTBIf, 


l/fv 


=  3kTB. 


(71) 


The  noise  power  spectral  density  is 


Wi  =  dkT 


(72) 


Given  the  equivalent  input  noise  power  spectral  density  from  Eq^cLtion  72  and 
the  noise  bandwidth  of  the  phase- locked  loop  from  Section  4.1,  the  mean  square 
phaise  noise  on  the  output  of  the  VCO  can  be  represented  as  [38] 


where  P,  is  the  photocurrent  input  signal  power,  or 


(73) 


2 


(74) 
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where  RjmWRpd  represents  the  equivalent  input  resistance  and  is  the  parallel  com¬ 
bination  of  the  pull-up  and  pull-down  resistors  in  the  voltage  divider. 

For  the  example  set  of  device  parameters  presented  in  Section  4.1  including 
an  equivalent  input  resistance  of  31.4  kQ,  a  noise  bandwidth  of  14.4  MHz,  and  an 
input  photocurrent  of  10  fxA,  the  mean  square  phase  noise  is  equal  to  1.14x10“’^ 
radians^  and  the  rms  phase  noise  is  3.37 x  10“'*  radians.  For  an  operating  frequency 
of  150  MHz,  this  corresponds  to  an  rms  timing  jitter  of  0.36  ps.  As  is  the  case  for 
the  transimpedance  receiver  presented  in  Chapter  3,  the  timing  uncertainty  due 
to  noise  at  each  receiver  is  negligible  when  compared  to  the  static  response  skew, 
but  the  overall  noise  performance  of  the  phase-locked  loop  is  orders  of  magnitude 
better  than  for  the  transimpedance  receiver. 


4.4  Optical  Power  Comparison 

The  difference  between  the  timing  uncertainy  performance  of  the  transimpedance 
receiver  and  the  phase-locked  loop  receiver  can  be  viewed  in  terms  of  the  inci¬ 
dent  optical  power  required  to  achieve  a  given  level  of  performance.  Since  the 
phase-locked  loop  receiver  requires  only  that  the  optical  signal  synchronize  ex¬ 
isting  electronic  local  oscillilators,  the  incident  optical  power  for  this  receiver  is 
much  less  than  that  required  by  the  transimpedance  receiver  to  achieve  the  same 
quality  of  timing  performance.  An  analysis  of  the  relationship  between  incident 
optical  power  and  steady  state  response  skew  for  both  receivers  is  presented  in 
this  section,  followed  by  a  similar  analysis  for  signal  jitter. 

The  response  skew  for  the  transimpedance  receiver  is  not  related  to  the  input 
signal  strength,  but  is  entirely  dependent  on  the  differences  in  transistor  channel 
dimensions  and  threshold  voltages  between  copies  of  the  receiver  on  a  given  chip. 
For  this  reason,  the  response  skew  of  the  transimpedance  receiver  is  constant  with 
respect  to  input  optical  power. 

The  relationship  between  incident  optical  power  and  stecidy-state  response 
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skew  for  the  phase-locked  loop  receiver  is  a  consequence  of  the  input  signal  level 
dependency  of  the  phase  detector  gain.  Using  Kd  as  defined  by  Equation  37  in 
Section  4.1,  with  the  receiver  response  skew  described  by  Equation  59  in  Sec¬ 
tion  4.2,  the  relationship  between  the  peak  photocurrent  and  phase-locked  loop 
response  skew  for  an  operating  frequency,  /,  is 


AuJoTT 


The  photocurrent  is  given  by  [43] 


Inh  — 


P uptVext^ 


where  Popt  is  the  incident  optical  power,  riext  is  the  external  quantum  efficiency,  q 
is  the  charge  on  a  electron,  and  hi/  is  the  photon  energy.  Substituting  Equation  76 
for  Iph  in  Equation  75,  the  relationship  between  the  response  skew  and  incident 
optical  power  is  given  by 


Au/oirhu 


fJcTt^Popt  L ( )m«n  ( ^ o^cq ) r 


The  external  quantum  efficiency  includes  reffection  loss  due  to  the  difference 
in  dielectric  constants  between  silicon  and  air  and  is  given  as 


^ext  —  Vintil  R') 


where  rjint  is  the  internal  quantum  efficiency  and  R  is  the  power  reflection  co¬ 
efficient  at  the  silicon  surface.  For  photodetection  in  silicon  with  illumination 
at  a  wavelength  of  633  nm  and  a  depletion  layer  width  of  2  microns,  the  in¬ 
ternal  quantum  efficiency  is  about  0.5  [44].  Assuming  no  antireflection  coating, 
the  reflection  coefficient  for  the  silicon  surface  is  about  0.3.  The  external  quan¬ 
tum  efficiency,  is  therefore  approximately  0.35  for  the  photodiodes  used  in 
the  receiver  design  examples  presented  in  Chapter  3  and  this  chapter,  hu/q  for 
633  nm  illumination  is  approximately  2  V"’.  Based  on  the  example  presented 
in  Section  4.2,  {KoReq)min  =  2.52  x  10*^  s"^  and  (7Co/?e?)mox  =  2.57  x  10*^  s~'. 
Using  these  values  in  Equation  77,  assuming  the  maximum  input  frequency  step. 
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Aujo  =  80.4  X  10®  radians/s  and  an  operating  frequency  of  150  MHz,  the  response 
skew  for  the  example  phase-locked  loop  receiver  is 


iak,ptl 


1.18  X  10"^® 


Opt 


(s) 


(79) 


where  Popt  is  expressed  in  watts. 


Figure  24  is  a  comparison  of  the  response  skew  for  the  transimpedance  and 
phase-locked  loop  receivers  as  a  function  of  incident  optical  power.  The  response 
skew  for  the  transimpedance  amplifier  is  constant  with  respect  to  optical  signal 
power,  and  is  shown  as  the  horizontal  dashed  line.  The  solid  line  represents  the 
phase-locked  loop  response  skew  as  given  by  Equation  79.  As  shown  in  the  figure, 
the  phase-locked  loop  can  operate  with  much  lower  incident  optical  power  to 
achieve  a  maximum  skew  less  than  the  transimpedance  receiver  value. 


In  addition  to  comparing  the  response  skew  for  the  two  receiver  examples,  the 
signal  jitter  due  to  noise  can  be  compared.  For  each  receiver,  the  rms  signal  jitter 
is  a  fimction  of  the  incident  optical  power.  The  relationship  of  signal  jitter  to 
incident  optical  power  for  the  transimpedance  receiver  is  presented  first,  followed 
by  a  similar  analysis  for  the  phase-locked  loop  receiver. 


From  Equation  11,  the  transimpedance  receiver  rms  signal  jitter  can  be  rep¬ 
resented  as 


tti.r,n.  =  17^  (80) 

where  is  the  mean  square  noise  current,  /  is  the  operating  frequency,  and  Ip^^  is 
the  photocurrent.  Substituting  Equation  76  for  /p/,  in  Equation  SO,  the  rms  signal 
jitter  as  a  function  of  optical  input  power  becomes 


- •  (81) 

2tVJ  PoptVextQ 

Assuming  an  operating  frequency  of  50  MHz  (the  maximiun  frequency  for  a  2- 
micron  implementation),  =  2.86x10“*®  A^,  Tjext  =  0.35  and  hu/q  =  2  V“*,  the 
rms  signal  jitter  for  a  2-micron  implementation  of  the  transimpedance  receiver  is 

7.17  X  10“*® 


t 


ti.rmj 


opt 


(s)  . 


(82) 
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Peak  incident  optical  power  (W) 


Figure  24;  Response  skew  for  trans impedance  (dashed)  and  pha^e- locked  loop 
(solid)  receiver  examples  vs.  peak  incident  optical  power. 


Using  Equation  73  for  the  mean  square  phase  noise  for  the  phase-locked  loop 
receiver,  Equation  48  for  the  noise  bandwidth,  and  Equation  76  for  the  photocur¬ 
rent,  the  rms  signal  jitter  for  the  phase-locked  loop  receiver  can  be  represented 

ELS 


_  J_ 

tpll.rm,  - 


\ 

^  kuiTTi  ~  ^2  ) 

O  1  opt<f  ] 

(  irhu 

_ 1_  1  \ 

L“\  hu  ) 

\  I\o^^qncxt  P op* 9 "^2  ) 

1/2 


(83) 


where  TU,  is  the  equivalent  noise  power  spectral  density,  and  ri  and  T2  are  the 
time  constants  of  the  lead-lag  low  pass  filter  cis  defined  in  Section  4.1. 


Using  the  e.xample  parameters  given  in  Section  4.3  for  the  noise  power  spectral 
density  and  the  phase-locked  loop  e.xample  parameters  from  Section  4.1,  the  rms 
signal  jitter  for  a  2-micron  implementation  of  the  phase-locked  loop  receiver  is 


»  2,69  X  10-”(6.S5  X  +  6.53  x  10')'^’ 


(s).  (84) 
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Peak  incident  optical  power  (W) 

Figure  25:  Rms  signal  jitter  for  transimpedance  (dashed)  and  phase-locked  loop 
(solid)  receiver  examples  vs.  peak  incident  optical  power. 

The  rms  signal  jitter  plots  for  the  two  receiver  examples  are  shown  in  Figure  25 
as  a  function  of  incident  optical  power.  In  the  figure,  the  dashed  line  represents 
the  transimpedance  amplifier  example,  while  the  solid  line  represents  the  phase- 
locked  loop  receiver.  Again,  the  phase-locked  loop  receiver  can  achieve  a  given  rms 
signal  jitter  specification  with  a  much  lower  incident  optical  power  requirement. 

A  comparison  of  optical  power  costs,  and  operating  parameters  for  the  two 
receiver  examples  that  have  been  presented  clearly  indicates  that  a  phase-locked 
loop  receiver  is  much  more  attractive  for  use  in  an  optical  clock  distribution  system 
for  a  VLSI  chip.  The  phase-locked  loop  can  be  implemented  with  considerably  less 
layout  area  and  will  operate  at  frequencies  that  represent  a  substantial  improve¬ 
ment  over  wired  clock  distribution  systems,  where  the  transimpedance  amplifier 
can  offer  at  best  marginal  frequency  improvement  over  wired  distribution  systems 
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while  requiring  massive  use  of  chip  surface  area  for  implementation.  In  addi¬ 
tion,  the  fabrication-related  response  skew  between  phase-locked  loop  receivers 
on  a  given  chip  is  orders  of  magnitude  less  than  the  similar  skew  between  trans¬ 
impedance  receivers  for  identical  incident  optical  power  levels.  While  the  clock 
signal  jitter  for  each  type  of  receiver  is  small  compared  to  the  response  skew,  the 
phase-locked  loop  receiver  also  offers  orders  of  magnitude  improvement  for  this  pa- 
reuneter  as  well.  Timing  uncertainty  specifications  can  be  maintainted  with  much 
lower  optical  power  requirements  for  the  phase-locked  loop  receiver  example. 


Chapter  5 

Test  Chip  and  Laboratory 
Measurements 


A  test  chip  has  been  designed  to  measure  various  operating  parameters  of  a  simple 
prototype  transimpedance  receiver  and  driver  circuit  for  an  optical  clock  distribu¬ 
tion  system.  This  chip  contains  circuits  designed  for  meastirement  of  the  operating 
frequency  and  response  skew  for  simple  transimpedance  optical  receivers  as  well  a 
set  of  circuits  for  measurement  of  leakage  currents  due  to  photogenerated  carriers. 

Section  5.1  is  a  general  description  of  the  test  chip.  Descriptions  of  the  cir¬ 
cuits  on  the  chip  and  procedures  for  me^lsurement  of  the  operating  frequency  of 
a  transimpedance  receiver  circuit  are  presented  with  laboratory  results  in  Sec¬ 
tion  5.2.  The  circuits,  procedures  and  data  for  the  response  skew  and  leakage 
current  tests  are  presented  in  Section  5.3  and  Section  5.4,  respectively. 
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5.1  General  Chip  Description 


Figure  26  is  a  photograph  of  the  CMOS  optical  clock  receiver  test  chip.  This 
chip  contains  three  separate  sets  of  test  circuits.  The  18  vertical  strips  of  devices 
on  the  chip  are  individual  optical  receivers.  The  nine  receivers  in  the  upper  half 
have  the  two-phase  clock  driver  circuits  included,  while  the  circuits  in  the  lower 
half  do  not.  The  receivers  in  the  lower  half  are  included  to  help  align  the  optical 
input  beam  to  the  photodiode  window  for  the  leakage  current  test;  the  outputs  of 
these  receivers  do  not  provide  information  regarding  the  data  being  measured  for 
the  test.  The  photodiodes,  shown  circled  in  the  photograph,  are  near  the  lower 
parts  of  the  receivers  and  are  20  x  20  microns  in  size.  The  area  surrounding  the 
photodiode  is  masked  by  an  aluminum  sheet  100  x  100  microns  with  a  window  to 
allow  light  to  penetrate  to  the  photodiode  while  protecting  the  surrounding  silicon 
from  incidental  illumination.  This  mask  is  part  of  the  top  metal  interconnect  layer, 
and  is  electrically  connected  to  the  ground  bus. 

The  leftmost  vertical  string  of  devices  in  the  upper  half  of  the  layout  com¬ 
prises  the  test  circuit  for  measuring  the  maximum  clock  rate.  The  eight  similar 
vertical  strips  to  the  right  of  this  circuit  are  receivers  used  in  the  skew  meastire- 
ment  test,  while  the  nine  vertical  blocks  in  the  lower  half  of  the  layout  contain 
the  leakage  current  measxirement  test.  This  chip  has  been  fabricated  with  the 
DARPA-sponsored  MOSIS  3-micron  CMOS  process.  The  die  size  is  6900  x  6800 
microns,  using  a  standard  40  pin  package.  While  a  1.5-  or  2-micron  fabrication 
process  could  have  produced  device  sizes  with  response  times  and  parameter  tol¬ 
erances  more  indicative  of  the  state-of-the-art  in  silicon  CMOS  technology,  the 
3-micron  process  was  chosen  because  it  represented  an  established  technology 
with  reasonably  good  reliability.  At  the  time  of  fabrication,  the  MOSIS  2-micron 
fabrication  process  did  not  have  an  established  delivery  and  reliability  reputation. 


The  MOSIS  facility  returned  13  copies  of  the  test  chip,  each  with  limited 
performance  reliability.  The  analog  nature  of  the  transimpedance  amplifier  makes 
this  design  very  sensitive  to  deviations  of  the  transistor  threshold  voltages  greater 
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Figure  26:  Optical  clock  distribution  test  chip. 
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Figure  27:  Transimpedance  optical  receiver  and  two-phase  clock  circuit  for  test 
chip. 

than  100  mV  from  the  nominad  design  values.  Since  the  MOSIS  CMOS  process 
is  primarily  a  digital  circuit  fabrication  service,  the  threshold  voltage  range  is 
larger  than  is  generally  acceptable  for  analog  circuit  implementation,  and  many 
of  the  receiver  circuits  on  the  test  chip  did  not  work  as  a  result.  The  location  of 
the  receivers  which  failed  varied  from  chip  to  chip,  however,  and  enough  working 
receivers  were  available  over  the  13  copies  of  the  chip  to  perform  measurements 
of  the  parameters  imder  test. 

The  transimpedance  receiver  that  has  been  presented  in  Chapter  3  is  a  princi¬ 
pal  element  in  each  of  the  three  test  circuits  on  the  chip  presented  in  this  chapter. 
The  design  presented  in  Chapter  3  has  been  scaled  to  accomodate  a  3-micron 
fabrication  process.  A  circuit  diagram  of  the  transimpedance  receiver  is  shown  in 
Figure  27.  Devices  labeled  A„  are  analog  inverter  stages,  while  those  labeled 
and  N„  are  digital  logic  gates.  Table  6  contains  device  dimensions  for  the  elements 
of  Figure  27. 


5.2.  OPERATING  FREQUENCY  TEST 
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NAME 

NMOS(/im) 

PMOS(^m) 

NAME 

NMOS(Axm) 

PMOS(/xm) 

Ai 

2070 

720 

Bi 

24 

24 

A2 

2070 

720 

B2 

24 

24 

A3 

1035 

360 

B3 

63 

123 

A4 

552 

192 

B4 

63 

123 

As 

276 

96 

Bs 

246 

492 

Ae 

138 

48 

Be 

246 

492 

Ar 

69 

24 

Bj 

246 

492 

Be 

246 

492 

N2 

63 

123 

N2 

63 

123 

Table  6:  Transistor  sizes  for  amplifier  and  clock  driver  circuits. 


The  amplifier  circuit  in  Figure  27  has  a  resistive  feedback  path  to  allow  self¬ 
biasing  and  tracking  of  quiescent  operating  voltages  for  the  chain.  The  photodiode 
for  this  particular  chip  design  is  a  p+diffusion  into  the  n  substrate  fabricated  in 
the  same  processes  used  to  make  sources  and  drains  for  the  p-channel  transistors 
in  the  standard  MOSIS  3-micron  p-well  technology.  The  capacitive  loading  that 
is  anticipated  for  distribution  of  each  clock  phase  in  the  context  of  a  practiced 
VLSI  system  is  modeled  on  the  test  chip  by  driving  a  large  inverter  stage  with 
each  clock  output  signal. 


5.2  Operating  Frequency  Test 

The  output  pads  and  pin  connections  for  the  dual  inline  package  (DIP)  in  which 
the  test  chip  is  mounted  are  limited  to  frequencies  lower  than  around  12  MHz. 
In  order  to  measure  on-chip  clock  frequencies  higher  than  the  bandwidth  of  the 
output  pads  on  the  test  chip,  a  frequency  divider  circuit  has  been  included.  This 
circuit  allows  the  indirect  measurement  of  the  on-chip  clock  signal  frequency  by 
observing  a  test  signal  with  a  frequency  less  than  1  MHz. 

The  test  circuit  for  measuring  the  clock  rate  of  the  optical  receiver  is  composed 
of  an  optical  receiver  and  clock  driver  circuit  described  in  the  previous  section  and 
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Optical  receiver 
&.  clock  driver 


Divide-by-128  circuit 


JK 

JK 

JK 

JK 

JK 

JK 

JK 

>  Q 

— 

>  Q 

>  Q 

— 

>  Q 

— 

>  Q 

— 

>  Q 

>  Q 

— 

Clk 

Clk 

Clk 

Clk 

Clk 

Clk 

Clk 

h-2 

-^2 

^2 

^2 

-f-2 

^2 

-7-2 

Figure  28:  Frequency  divider  circuit  for  ofF-chip  measurement  of  output  frequency. 


Figure  29;  Optical  layout  for  frequency  measurement. 

a  divide-by-128  circuit.  A  block  diagram  of  the  circuit  is  shown  in  Figure  28.  The 
output  of  one  of  the  phases  of  the  clock  driver  circuit  is  used  as  a  clock  input 
to  the  divide-by-128  circuit.  The  divide-by-128  circuit  is  a  ripple  counter  circuit 
composed  of  successive  master-slave  JK  flipflops  that  have  been  configured  to 
toggle  the  output  state  for  each  clock  cycle.  Each  stage  divides  the  frequency  of 
its  clock  input  by  a  factor  of  2(45]. 

A  diagram  of  the  optical  layout  used  for  the  operating  frequency  measurement 
is  shown  in  Figure  29.  Since  only  a  single  spot  of  light  is  required  for  the  clock 
input  for  the  test,  a  hologram  is  not  required.  Instead,  a  single  laser  beam  is 
focused  to  a  spot  on  the  chip  surface.  In  the  test  configuration,  a  continuous  wave 
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Ser. 

No. 

Maximum  Input 
Frequency  (MHz) 

Ser. 

No. 

Maximum  Input 
Frequency  (Mhz) 

1 

16.8 

7 

18.0 

3 

14.7 

9 

13.2 

4 

18.5 

11 

16.0 

5 

12.2 

12 

14.1 

6 

15.2 

13 

12.0 

Table  7;  Maximum  operating  frequencies  for  sample  of  chips. 

(CW)  He-Ne  laser  beam  is  passed  through  an  acousto-optic  modulator  (AOM). 
Li  is  a  lens  used  to  focus  the  laser  beam  to  a  narrow  waist  for  interaction  with 
the  acousto-optic  crystal,  as  recommended  in  the  application  notes  for  the  device 
[46].  By  spatially  bandpass  filtering  the  1st  order  beam  out  of  the  acousto-optic 
modulator  by  means  of  a  diaphram,  the  AOM  can  be  used  as  a  shutter.  In  this 
manner,  the  light  beam  incident  on  the  chip  surface  is  modulated  on  and  off.  The 
spot  is  imaged  by  lens  onto  the  photodiode  on  the  test  chip.  The  frequency 
and  waveform  of  the  optical  signal  can  be  controlled  by  an  electronic  input  signal 
to  the  acousto-optic  modulator.  For  the  tests  in  this  section  and  Section  5.3,  the 
input  signal  wets  a  square  wave  of  50  per  cent  duty-cycle. 

The  operating  frequency  tests  were  performed  by  applying  an  optical  square 
wave  of  150  fiW  peak  power  to  the  photodiode  corresponding  to  the  transimped¬ 
ance  receiver  which  provides  the  input  signal  to  the  frequency  divider  circuit.  The 
frequency  of  the  optical  signal,  was  increased  until  the  low  frequency  output 
signal  of  the  JK  flip-flop  circuit  was  no  longer  a  TTL  level  signal  with  a  frequency 
equal  to  /,„/128.  The  measured  maximum  operating  frequencies  for  the  copies  of 
the  chip  tested  are  shown  in  Table  7.  The  mean  value  of  the  maximum  operating 
frequency  was  15.1  MHz,  while  the  variance  was  1.95  MHz^.  The  receivers  for  the 
operating  frequency  test  on  serial  numbers  2,  8  and  10  did  not  work.  The  total 
sample  included  measurements  on  10  chips. 

While  the  operating  frequency  measurements  are  based  on  a  sample  size  too 
small  to  have  statistical  significance,  the  results  give  an  indication  of  the  typical 
operating  frequency  that  is  possible  with  a  transimpedance  receiver  implemented 
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in  3-micron  CMOS  technology.  The  frequencies  measured  confirm  that  the  trans- 
impedance  receiver  approach  for  optical  clock  distribution  cannot  give  the  high 
frequency  performance  required  to  represent  an  improvement  over  existing  elec¬ 
tronic  systems  using  metal  wire  paths. 

The  measured  results  are  also  within  a  range  of  frequencies  that  might  be 
expected  for  a  3-micron,  implementation  scaled  from  the  2-micron  simulations 
performed  in  Chapter  3.  The  2-micron  simulations  showed  operation  up  to  50  - 
67  MHz  for  TTL-level  output  voltages.  Scaling  these  frequencies  down  by  a  factor 
of  1.5  to  reflect  the  larger  transistors  in  the  3-mi  ;ron  test  circuits,  the  expected 
maximum  operating  frequencies  are  30  -  45  MHz.  The  measured  values  are  lower 
than  simple  scaling  predicts,  but  are  within  a  factor  of  2  or  3. 


5.3  Receiver  Skew  Test 

The  receiver  skew  test  is  a  measiurement  of  the  difference  in  response  times  of 
identical  optical  receiver  and  clock  driver  circuits  that  are  fabricated  on  the  same 
chip.  Since  chip  fabrication  causes  variation  of  transistor  channel  dimensions  and 
threshold  voltages,  the  characteristics  of  identical  circuits  on  a  given  chip  vary 
slightly.  This  variation  is  generally  observed  to  be  more  severe  as  the  separation 
between  the  circuits  increeises.  In  this  test,  the  clock  driver  output  signals  of  eight 
identical  circuits  are  compared  in  pairs.  The  skew  between  receiver  outputs  is 
measured  by  generating  a  pulse  of  width  corresponding  to  the  separation  between 
the  rising  (or  falling)  pulse  edges  for  the  two  receiver  output  signals.  This  skew 
pulse  is  created  by  using  the  two  receiver  outputs  as  inputs  to  an  exclusive-OR 
circuit. 

A  diagram  of  the  circuit  used  to  determine  the  skew  between  the  output  signals 
of  the  clock  driver  pairs  is  shown  in  Figure  30.  The  four  logic  gates  at  the  top  of 
the  figure  perform  an  exclusive-OR  operation  on  the  output  signals  of  two  optical 
receiver  and  clock  driver  circuits.  Since  the  exclusive-OR  output  is  true  when 
one  and  only  one  input  is  true,  two  pulses  of  width  corresponding  to  the  time 
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Figure  30:  Receiver  skew  measurement  circuit. 

separation  of  the  clock  driver  outputs  are  produced;  the  first  pulse  is  formed  by 
the  separation  of  the  leading  edges  of  the  two  input  signals,  while  the  second  pulse 
is  formed  by  the  separation  of  the  trailing  edges  of  the  pulses.  The  exclusive-OR 
output  pulses  are  negative-true  logic  levels,  while  the  inputs  are  positive-true. 
The  load  capacitor,  Ci,  is  charged  by  a  PMOS  pass  transistor  when  the  output 
of  the  exclusive-OR  circuit  is  low.  If  the  pulse  is  short  enough,  the  capacitor  does 
not  charge  entirely,  and  the  final  voltage  across  the  capacitor  is  a  measure  of  the 
charging  time,  and  therefore  a  measure  of  the  skew  pulse  width  and  the  skew 
between  the  receiver  output  signals.  A  separate  path  is  provided  for  calibration 
of  the  system,  and  a  discharge  input  allows  residual  charge  on  the  load  capacitor 
to  be  removed  to  initialize  the  system  for  testing. 

This  system  allows  eeisy  calibration  of  chip  package  and  oscilloscope  probe 
parcisitics,  because  the  charging  characteristics  of  the  load  capacitor  include  the 
loading  effects  of  the  oscilloscope  and  probe.  Since  the  exact  RC  value  is  not  im¬ 
portant,  the  added  loading  of  the  off-chip  measurement  equipment  does  not  effect 
the  accuracy  of  the  measurement.  The  mcEisurement  is  calibrated  by  observing  the 
charging  characteristic  of  the  capacitor  (including  oscilloscope  and  probe)  when  a 
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step  input  is  applied  to  the  ceJibration  input  leaid.  The  measured  capacitor  voltage 
during  the  test  is  compared  to  the  point  on  the  calibration  charging  character¬ 
istic,  and  the  corresponding  time  to  charge  the  load  capacitance  to  that  voltage 
is  recorded.  This  charging  time  corresponds  to  the  width  of  the  skew  pulse,  and 
therefore  to  the  skew  between  the  receiver  output  signals.  The  resolution  of  the 
exclusive-OR  gate  is  about  four  times  the  propagation  delay  for  a  typical  logic 
gate. 

The  optical  layout  for  the  skew  test  is  shown  in  Figure  31.  This  layout  is 
similar  to  that  used  for  the  operating  frequency  test  shown  in  Figure  29  in  that 
an  zw:ousto-optic  modulator  is  used  as  a  shutter  to  modulate  the  CW  laser  input. 
The  first  order  output  of  the  AOM  is  first  collimated  by  lens  Li,  then  split  into 
two  beams.  One  beam  is  extended  in  length  by  means  of  mirrors,  and  both  beams 
are  focused  onto  the  chip  surface  by  lenses  L2  and  L3.  Each  spot  is  focvised  onto 
the  appropriate  photodiode  for  the  corresponding  receiver  and  clock  driver  circuit 
in  the  pair  being  compared. 

The  added  optical  pathlength  of  one  of  the  beams  allows  verification  of  the 
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Ser. 

Receiver 

Skew 

Ser. 

Receiver 

Skew 

No. 

pair 

(ns) 

No. 

pair 

(ns) 

1 

0,4 

10.0 

3 

0,4 

15.0 

3 

1,5 

20.0 

3 

2,6 

15.0 

3 

3,7 

20.0 

4 

0,4 

20.0 

4 

1,5 

15.0 

5 

2,6 

8.0 

5 

3,7 

8.0 

7 

2,6 

12.0 

9 

0,4 

17.5 

9 

1,5 

10.0 

9 

3,7 

10.0 

13 

1,5 

15.0 

13 

2,6 

5.0 

Table  8:  Measured  receiver  response  skew  between  transimpedance  receiver  and 
clock  driver  pairs. 


skew  measurement.  The  difference  in  pathlengths  corresponds  to  zin  external  de¬ 
lay  of  5.5  ns  for  one  of  the  optical  input  signals.  This  artificially  extends  the  skew 
between  the  outputs  of  the  two  receivers  by  a  known  amount.  The  measured 
skew  is  actually  the  sum  of  the  external  skew  due  to  the  difference  in  optical 
pathlengths  and  the  receiver  skew  due  to  fabrication  process  variation  of  transis¬ 
tor  parameters.  An  accurate  measurement  of  the  actual  receiver  response  skew 
can  be  performed  by  recording  the  skew  time  for  the  optical  beams  focused  to 
the  two  photodiodes  in  one  configuration,  then  reversing  the  configuration  and 
observing  the  results.  The  actual  receiver  response  skew  is  the  average  of  the  two 
measurements.  The  accuracy  can  be  confirmed  by  verifying  that  the  difference 
between  the  two  measurements  is  twice  the  difference  in  the  optical  pathlengths, 
or  11  ns. 

The  measured  data  are  shown  in  Table  8.  For  these  measurements,  the  peak 
optical  power  of  the  spots  received  through  the  mask  window  was  120  /xW  each. 
The  optical  signal  was  a  very  low  frequency  square  wave  with  50  per  cent  duty- 
cycle.  This  allowed  a  stable  continuous  trace  on  the  oscilloscope  with  a  period 
between  pulses  which  was  long  enough  for  the  stored  charge  on  the  load  capacitor 
to  decay  without  need  for  external  discharge  to  reset  the  test.  The  average  skew 
between  the  receiver  pairs  sampled  was  13.25  ns  and  the  variance  was  2.26  ns^. 
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Photodiode 

L 
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Figure  32:  Test  circuit  to  measure  leakage  current  between  photodiode  and  nearby 
storage  cell. 

The  range  of  measured  skew  values  was  5  -  20  ns.  The  skew  data  between  the 
receiver  pairs  not  listed  in  Table  8  were  unavailable  because  of  threshold  failure 
of  one  or  both  of  the  receivers  in  the  pair.  The  mean  skew  measured  is  about  four 
times  the  expected  propagation  delay  for  a  logic  gate  for  the  trauisistor  sizes  used 
in  the  test  circuit. 


5.4  Leakage  Current  Test 

The  leakage  current  test  is  composed  of  nine  optical  receivers,  each  with  a  dynamic 
latch  cell  positioned  near  the  photodiode.  A  diagram  of  a  typical  test  circuit  for 
determining  leakage  currents  is  shown  in  Figure  32.  The  separation  between  the 
photodiode  and  the  latch  cell  is  varied  from  one  receiver  to  the  next.  The  receivers 
in  this  test  have  been  designed  with  guard  rings  around  the  photodiodes  and  the 
associated  latch  cells.  The  photodiode  is  part  of  a  transimpedance  receiver  as  is 
the  case  in  the  other  test  circuits  on  the  chip;  however,  the  output  of  the  optical 
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Figure  33:  Optical  layout  for  leakage  current  measurement. 

receiver  for  this  test  is  used  only  to  aid  in  the  alignment  of  the  optical  beam  with 
the  photodiode  mask  window. 

The  optical  layout  for  the  leakage  current  test  is  shown  in  Figure  33.  Since 
no  modulated  signal  is  required  for  this  test,  the  acousto-optic  modulator  is  not 
used.  The  beam  is  focused  directly  onto  the  photodiode  mask  window.  The  beam 
can  be  blocked  by  an  opaque  screen  to  allow  measurement  of  the  dark  storage 
time  of  the  latch  cell  on  the  chip. 

In  this  test,  the  storage  time  of  each  dynamic  latch  is  observed  as  the  photo¬ 
diode  is  illuminated,  and  compared  to  the  dark  storage  time  to  give  an  indication 
of  the  leakage  currents  caused  by  diffusion  of  carriers  generated  by  deeply  pene¬ 
trating  photons  to  transistors  in  the  latch.  The  different  spacings  between  latch 
cells  and  photodiodes  allow  quantification  of  the  relationship  between  the  leakage 
current  strength  and  the  distance  carriers  must  diffuse  from  the  substrate  imder 
the  photodiode  to  reach  the  depletion  layer  of  the  latch  cell  pass  transistor  source 
region. 

If  the  pass  transistor  is  modeled  as  a  perfect  capacitor  and  the  leakage  cur¬ 
rents  are  modeled  as  independent  current  sources,  then  the  discharging  time  of 
the  capacitor  indicates  the  strength  of  the  total  current  which  transports  charge 
away  from  the  transistor  source  area.  For  normal  operation,  the  pass  transis¬ 
tor  has  a  very  small  leakage  current,  and  the  charge  remains  on  the  source  for 
a  number  of  milliseconds  before  decaying.  An  additional  leakage  current  due  to 
carriers  diffusing  to  the  source  area  depletion  region  from  the  photodiode  causes 
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Ser. 

No. 

Separation  (pm) 

18 

27 

37.5 

60 

1 

0.4 

1.24 

2.6 

5.0 

2 

1.6 

2.0 

3.25 

7.2 

3 

0.5 

1.48 

2.25 

5.7 

4 

0.5 

0.7 

3.0 

5.0 

5 

0.45 

1.2 

2.25 

4.5 

6 

0.55 

1.2 

2.5 

6.5 

7 

0.175 

0.9 

1.75 

4.5 

8 

0.3 

0.9 

1.8 

5.0 

9 

0.025 

0.55 

1.5 

3.6 

10 

0.42 

1.0 

2.0 

4.2 

11 

1.0 

1.6 

3.0 

5.5 

12 

0.4 

0.84 

2.0 

5.2 

13 

0.15 

1.4 

2.25 

5.5 

Table  9:  Storage  times  in  microseconds  for  latch  cell  with  2.2  mW  optical  power 
incident  on  photodiode  at  various  separations  to  latch  cell. 

the  charge  to  dissipate  from  the  pass  transistor  at  a  much  higher  rate.  This  rate 
is  proportional  to  the  number  of  carriers  present  at  the  photodiode,  and  therefore 
is  a  linear  fimction  of  the  optical  power  incident  on  the  photodiode.  If  the  sep¬ 
aration  between  the  photodiode  and  nearby  transistor  depletion  regions  is  large 
enough,  many  of  the  carriers  which  diffuse  in  the  direction  of  these  devices  recom¬ 
bine  before  reciching  the  nearby  transistor.  Increeising  the  separation  allows  more 
carriers  to  recombine;  therefore,  the  leakage  ctirrent  is  reduced. 

Table  9  lists  the  storage  times  in  microseconds  for  the  range  of  spacings  be¬ 
tween  photodiodes  and  latch  cells  for  the  sample  of  test  chips.  A  plot  of  these 
values  is  shown  in  Figure  34.  .’hijse  mecisurements  were  performed  using  an  op¬ 
tical  wavelength  of  633  nm.  1  t  •  storage  times  would  be  considerably  shorter  for 
an  incident  optical  wavelength  73v?  f’m,  since  a  much  larger  percentage  of  photons 
would  penetrate  into  the  s  ’bstiu-  beyond  the  depletion  layer  of  the  photodiode 
and  there  would  be  therefore  move  carriers  to  diffuse  to  the  nearby  latch  cell.  The 
maximum  available  optical  power,  2.2  mW,  was  used  for  the  meaisurements  in  this 


Figure  34; 
diode  and 
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test.  Since  the  leakage  current  increases  linearly  with  the  incident  optical  power, 
the  storage  time  is  inversely  proportional  to  the  optical  power,  and  the  storage 
time  for  a  lower  level  of  input  light  can  be  interpolated  by 


^  p 

opt.2 


(85) 


where  t\  is  the  known  storage  time  corresponding  to  an  incident  optical  power 
Popt,\y  Popta  the  new  power  level,  and  #2  is  the  new  storage  time.  Equation  85 
is  valid  as  long  as  the  dark  leakage  current  through  the  pass  transistor  is  small 
compared  to  the  leakage  current  due  to  the  illumination.  Using  the  data  from 
serial  number  1  in  Table  9  and  the  relationship  in  Equation  85,  an  incident  optical 
power  of  100  pW,  has  an  expected  storage  time  of  aroimd  9  ps  for  a  separation 
of  18  microns,  and  a  storage  time  of  0.1  ms  for  a  separation  of  60  microns.  By 
limiting  the  incident  optical  power  and  requiring  a  minimum  spacing  design  rule, 
the  storage  time  for  dynamic  latch  cells  in  the  vicinity  of  a  photodiode  can  be 
maintained  to  within  system  specification. 


The  test  chip  and  data  that  have  been  presented  in  this  chapter  demonstrate 
some  of  the  limits  in  the  usability  of  a  transimpedance  optical  receiver  for  clock 
distribution  that  were  anticipated  based  on  the  initial  investigation  presented  in 
Chapter  3.  The  layout  area  cost  per  receiver  is  very  large,  while  the  maximum 
operating  frequency  for  a  3-micron  implementation  has  been  shown  to  be  limited 
to  a  range  of  10  -  20  MHz.  While  a  2-micron  implementation  would  be  expected 
to  operated  at  a  higher  frequency,  the  improvement  would  not  be  greater  than  a 
factor  of  2;  therefore,  a  2-micron  design  could  be  expected  to  have  a  maximum 
operating  frequency  less  than  40  MHz.  In  addition,  the  response  skew  was  found 
to  be  appreciable.  The  measured  response  skew  range  was  5  -  20  ns,  with  20 
ns  representing  about  6  times  the  average  propagation  delay  for  a  logic  gate  on 
the  chip.  Clock  distribution  systems  based  on  metal  wiring  can  provide  coverage 
for  a  VLSI  chip  with  appreciably  less  skew,  indicating  that  the  transimpedance 
receiver  for  optical  clock  distribution  cannot  provide  improved  skew  performance. 
These  limits  are  not  fundamental  to  the  optical  clock  distribution  approach,  but 
rather  to  the  type  of  optical  receiver  that  has  been  implemented  for  the  test  chip. 
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A  phase-locked  loop  receiver  such  as  that  presented  in  Chapter  4  allows  improved 
performance  with  less  layout  area  cost. 

The  leakage  current  data  presented  in  this  section  represent  a  limitation  of 
the  optical  clock  distribution  approach  regardless  of  receiver  type,  when  the  con¬ 
straints  of  implementing  the  photodiode  8is  a  vertical  p-n  jvmction  with  existing 
CMOS  fabrication  steps  and  bias  voltages  are  observed.  The  problem  is  increased 
as  the  wavelength  of  illumination  is  increased,  due  to  the  larger  absorption  length 
of  photons  for  longer  wavelengths  of  light.  Based  on  the  data  presented  in  this 
section,  the  storage  time  for  dynamic  latch  cells  can  be  preserved  at  a  cost  of 
increased  separation  between  devices  or  a  reduction  in  optical  power.  This  also 
indicates  that  a  phase-locked  loop  receiver  is  preferable  since  operation  of  this 
type  of  receiver  requires  much  lower  optical  input  power. 
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Chapter  6 
Conclusions 


A  design  and  analysis  of  an  optical  clock  distribution  system  have  been  presented. 
A  summary  of  contributions  is  given  in  Section  6.1,  followed  by  suggestions  for 
future  work  in  Section  6.2. 


6.1  Summeiry  of  Contributions 

This  investigation  has  involved  the  design  of  an  optical  clock  distribution  system 
for  silicon  VLSI  computer  chips.  A  general  description  of  the  clock  distribution 
system  has  been  presented  in  Chapter  2  along  with  the  goals  and  constraints  for 
the  system  design  and  the  engineering  trade-olFs  that  are  inherent  in  them.  This 
system  uses  a  free-space  three-dimensional  approach  in  which  the  optical  clock 
signal  is  mapped  from  an  off-chip  light  source  via  a  holographic  or  lenslet  array 
element  to  several  photodetectors  integrated  on  the  silicon  chip  surface.  In  the 
analysis  presented  in  this  thesis,  the  effect  of  crosstalk  from  secondary  reflections 
is  not  considered. 

In  Chapter  3,  the  first  of  two  receiver  designs  has  been  presented.  The  transim¬ 
pedance  receiver  presented  has  been  adapted  from  a  well-established  optical  com- 
mimication  approach.  It  has  been  shown  by  circuit  simulation  experiments  and 
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noise  jitter  analysis,  however,  that  an  implementation  of  this  receiver  in  CMOS 
for  integrated  detection  of  optical  clock  signals  does  not  offer  a  substantial  im¬ 
provement  in  performance  over  existing  metal  wire  clock  distribution  methods. 
The  maximum  operating  frequency  for  a  2-micron  implementation  of  the  trans- 
impedance  receiver  and  clock  driver  is  approximately  50  MHz,  and  is  limited  by 
the  response  skew  between  receivers  implemented  in  different  locations  on  a  given 
chip. 

The  poor  performance  of  the  transimpedance  receiver  has  led  to  the  develop¬ 
ment  of  the  phase-locked  loop  receiver  presented  in  Chapter  4.  Circuit  simulations 
presented  show  that  this  receiver  can  operate  at  frequencies  of  100  -  200  MHz  with 
picosecond  steady-state  response  skew.  In  addition,  the  noise  analysis  presented 
shows  that  the  phase-locked  loop  can  operate  with  sub-picosecond  signal  jitter. 
The  comparison  of  optical  power  requirements  for  the  two  receiver  examples  pre¬ 
sented  shows  that  the  phase-locked  loop  receiver  can  meet  system  timing  speci¬ 
fications  with  much  lower  incident  optical  power  levels  than  the  transimpedance 
receiver. 

A  CMOS  chip  to  measure  three  parameters  of  an  integrated  optical  receiver 
and  photodiode  have  been  presented  in  Chapter  5.  This  test  chip  includes  tests  to 
quantify  the  maximum  operating  frequency  and  response  skew  for  a  typical  trans¬ 
impedance  receiver  with  3  micron  channel  length  transistors,  as  well  as  measure 
the  leakage  current  between  an  integrated  photodiode  and  nearby  dynamic  stor¬ 
age  devices.  The  maximum  operating  frequency  of  the  transimpedance  receiver 
has  been  measured  to  be  approximately  15  MHz  for  the  3-micron  implementa¬ 
tion,  while  the  skew  between  output  signals  of  paired  receivers  ranges  from  5  - 
20  ns.  The  leakage  current  between  the  carriers  generated  in  the  substrate  under 
the  photodiode  can  be  reduced  by  limiting  the  incident  optical  power  or  requiring 
a  minimum  separation  between  the  photodiode  and  sensitive  devices.  A  storage 
time  for  a  nearby  dynamic  latch  cell  of  at  least  0.1  ms  can  be  maintained  with  a 
minimum  spacing  of  60  microns  zind  a  maximum  optical  input  power  of  100  /iW. 
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6.2  Suggestions  for  Future  Work 

The  work  presented  here  represents  an  initial  effort  in  demonstrating  the  feasi¬ 
bility  of  optical  clock  distribution  for  VLSI  chips.  The  phase-locked  loop  receiver 
presented  in  Chapter  4  should  be  implemented  in  a  test  chip  and  the  operating 
parameters  of  the  receiver  such  as  the  conversion  gains,  the  range  of  input  fre¬ 
quencies  to  which  to  loop  will  lock,  and  the  difference  in  steady-state  phase  errors 
between  identical  receiver  copies  should  be  measured. 

In  addition  to  measuring  the  phase-locked  loop  receiver  parzuneters,  a  test  con¬ 
figuration  in  which  the  clock  is  distributed  optically  to  several  receiver  sites  on  a 
chip  should  be  implemented.  Such  a  configuration  would  require  the  development 
of  an  optical  mapping  element,  and  would  give  an  indication  of  the  production 
and  alignment  problems  inherent  in  the  incorporation  of  a  hologram  or  lenslet  ar¬ 
ray  into  the  optical  distribution  system.  The  electronic  logic  circuits  on  the  chip 
receiving  the  optical  clock  signal  should  be  an  existing  CMOS  system,  adapted  to 
include  phase-locked  loop  optical  receivers  and  local  polysilicon  clock  distribution 
within  functional  cells. 

A  further  investigation  of  the  leakage  current  and  response  time  for  the  pho¬ 
todiode  using  near  infrared  illumination  should  be  performed.  Since  a  realistic 
package  configuration  for  the  optical  clock  distribution  system  would  probably  in¬ 
volve  a  semiconductor  optical  source,  the  detection  parameters  for  illumination  at 
wavelengths  available  with  Iziser  diodes  should  be  measured,  so  that  the  feasibility 
of  using  coherent  light  and  a  holographic  mapping  element  can  be  determined. 

Finally,  an  investigation  of  the  effects  of  secondary  reflections  of  the  light  beam 
should  be  performed.  Since  the  analysis  presented  here  neglects  these  effects,  this 
zissumption  should  be  verified  or  the  effects  should  be  characterized  so  that  a  more 
detailed  analysis  can  be  performed. 
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Introduction 


In  conventional  VLSI^  design,  the  data  path  contains  a  combination  of  syn¬ 
chronous  and  asynchronous  logic.  Typically,  combinational  logic  is  separated 
by  latches  (registers)  which  hold  intermediate  results;  and  each  latch  is  activated 
by  a  global  clock  signal.*  Therefore,  the  clock  signal  is  one  of  the  most  heavily 
loaded  and  most  widely  distributed  signals  on  the  chip  surface.  With  current  de¬ 
vice  sizes  and  geometries,  the  delay  associated  with  propagation  down  a  length 
of  wire  dominates  the  delay  associated  with  swicching  devices  [6,  pp.  200-231]. 
Hence,  the  delay  from  when  the  clock  actually  switches  until  when  the  gate  of 
interest  switches  is  dominated  by  effects  of  the  wire  path  connecting  the  gate  to 
the  clock  driver.  This  delay  is  proportional  to  the  length  of  the  wire  path  and 
will  therefore  be  different  for  gates  located  at  different  distances  from  the  clock 
driver.  The  difference  in  “arrival  times”  of  the  clock  signals  at  the  various  gates 
is  referred  to  as  clock  skew,  and  the  maotimum  clock  skew  is  one  limit  on  the 
operating  frequency  of  the  clock. 

In  a  free-space  optically  distributed  clocking  scheme,  the  clock  signal  is  gen- 


’  Very  I.ar;e  Scale  Inte^racion 

^  .\ciuaUy,  most  clocking  schecnea  feature  muiti-phosed  clocks.  Therefore,  strictly  speaking, 
the  latches  ore  driven  by  phases  of  the  clock. 
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erated  (using  an  off-chip  laser  or  LED)  and  mapped  optically  onto  various  sites 
on  the  chip  s  surface.  Here,  receivers  convert  the  optical  clock  into  an  electrical 
clock  signal  and  distribute  it  locally  on  the  silicon  surface.  Since  the  speed  of 
light  is  vastly  greater  than  the  speed  of  chip  signal  propagation,  the  optical  por¬ 
tion  of  this  system  is  essentially  skew-free.  With  the  chip  divided  into  regions 
which  are  serviced  by  local  clock  receivers,  the  maximum  distance  a  signal  must 
electrically  propagate  is  reduced.  Hence,  the  maximum  difference  in  clock  signal 
arrival  times  is  reduced  and  the  system  should  be  capable  of  operation  at  higher 
clock  frequencies,  assuming  that  the  skew  introduced  (between  local  clocks)  by 
the  optical  receivers  is  not  excessive. 

As  detailed  by  Clymer  [2],  optically  distributed  clocking  may  indeed  be  a 
feasible  alternative  to  electrically  distributed  clocking  in  regard  to  VLSI  cir¬ 
cuits  fabricated  on  silicon.  To  be  feasible  for  such  implementation,  an  optically 
distributed  clocking  system  should  be  able  to  match  or  exceed  global  system 
clock  rates  attainable  using  conventional  electrically  distributed  clocking.  Fur¬ 
thermore.  to  minimize  the  increased  cost  of  such  an  optical  clock  scheme,  it 
is  desirable  to  be  able  to  fabricate  the  necessary  receiving  circuitry  using  a 
standard  silicon  technology.^  In  addition,  an  optical  clock  scheme  should  avoid 
necessitating  the  use  of  regularly  spaced  optical  receiver  circuits,  since  such  a 
requirement  conflicts  with  standard  VLSI  design  practice  of  laying  out  regular 
functional  blocks  and  then  placing  these  blocks  in  locations  which  attempt  to 
optimize  system  performance  but  which  are  not  necessarily  arranged  in  a  regular 
grid.  Finally,  the  silicon  area  occupied  by  the  optical  clock  receivers  should  be 
small  enough  that  useful  circuits  can  still  be  fabricated  on  the  remaining  silicon 
surface. 

Clymer  has  made  two  attempts  to  design  optical  clock  systems  which  meet 
the  above  criterion.  The  first  design  used  an  established  transimpedance  receiver 


^Noie  that  this  precludes  the  use  of  p-i-n  photodiodes,  hybrid  phoiodetector/VLSI  pack¬ 
ages,  and/ or  special  implants  to  modify  the  photosensitivity  of  the  silicon  surface. 
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to  detect  the  optical  signal  and  then  used  a  series  of  analog  stages  to  amplify 
the  signal  to  TTL  signal  levels.  A  series  of  test  chips  were  fabricated  in  the 
DARPA-sponsored  MOSIS  3^m  n-well  CMOS  technology.  The  test  chips  had 
an  average  maximum  operating  frequency  of  15MHz  and  an  average  receiver 
skew  of  13.25ns.  Unfortunately,  the  transimpedance  receiver  circuits  required  a 
large  amount  of  silicon  surface  area  and  did  not  show  marked  improvement  over 
existing  electrically  distributed  clocking  approaches,  and  hence  did  not  seem  to 
represent  a  viable  alternative  to  electrically  distributed  clocking. 

Clymer's  second  design  utilizes  a  phase-locked  loop  approach  to  the  problem 
of  receiving  an  optically  distributed  signal.  In  this  approach,  local  oscillators 
located  in  various  places  on  the  chip  surface  are  used  to  generate  the  clock  signal; 
and  the  optical  signal  is  used  to  synchronize  these  local  oscillators’  clocks  via 
phase-locking.  (See  section  2.1  for  details.)  Since  the  optical  signal  is  only- 
used  to  synchronize  the  local  oscillators,  it  doesn’t  need  as  much  amplification 
as  it  would  to  drive  the  circuitry  associated  with  a  transimpedance  approach. 
With  less  amplification,  there  is  less  of  a  chance  of  introduced  skew  (due  to 
cross-chip  process  variations)  between  local  clocks.  Hence,  the  phase-locked 
loop  approach  should  result  in  less  skew  and  should  therefore  allow  a  higher 
operating  frequency.  Clymer's  simulations  indicate  that,  when  fabricated  in 
2/im  CMOS,  such  a  design  should  be  capable  of  operation  with  a  maximum 
frequency  in  the  range  of  100-200MHz.  It  is  the  subject  of  this  thesis  to  report 
on  an  attempted  implementation  of  a  modification  of  Clymer’s  design  using  the 
DARPA  sponsored  MOSIS  2/im  n-well  CMOS  technology. 

In  summary,  a  phase-locked  loop  optical  receiver  has  been  designed  subject 
to  the  criteria  of  using  standard  silicon  fabrication  technology,  allowing  random 
clock  receiver  locations,  and  of  minimizing  the  use  of  silicon  surface.  This 
design  is  a  modification  of  Clymer’s  proposed  phase-locked  loop  design;  such 
modifications  being  necessary  to  allow  implementation  in  the  chosen  technology. 
Results  of  simulations  of  this  circuit  are  provided,  and  test  chips  implementing 
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Chapter  2 

Design  of  Test  Chip 


To  test  the  phase  locked-loop  design,  it  was  not  only  necessary  to  fabricate  tf 
local  Optical  Receiver/Clock  Amplifier  (ORCA)  cells,  but  it  was  also  necessary 
to  provide  diagnostic  circuitry  on  the  chip.  This  diagnostic  circuitry  is  comprised 
of  skew  detection  units, ^  frequency  division  units,^  and  functional  test  units. ^ 

The  Optical  Clock  Distribution  II  (OCD2)  chip  as  submitted  to  MOSIS 
contains  9  ORCA  units,  12  skewdetector  circuit  blocks,  1  multiplexer/ frequency 
divider  unit,  and  4  shift  register/test  units.  The  9  ORCA  units  form  a  3  by  3 
array;  and  between  each  two  horizontally  or  vertically  adjacent  ORCAs  is  a 
skewdetector  block.  The  other  circuitry  is  located  on  the  array's  periphery. 
(See  figure  2.1.) 

Each  circuit  block  will  now  be  described  in  detail. 


'The  skew  detectors  are  designed  to  check  for  skew  between  two  adjacent  ORCA  units  by 
comparing  corresponding  non-overiapping  clock  phases. 

^The  frequency  dividers  are  provided  to  permit  measurement  of  the  ~100MHi  clock  fre¬ 
quency  usjng  output  pins  with  a  maximum  operating  frequency  of  approximately  i:-20MHz. 

^The  functional  test  units  are  designed  to  insure  that  the  clocks  are  capable  of  clocking 
useful  computational  circuitry,  as  well  as  to  place  a  "load"  on  selected  clocks. 
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Shift  Register 

Test  Units  Skewdetectors 


Figure  2.1;  Block  diagram  layout  of  OCD2  chip. 


Figure  2.2:  Block  diagram  of  Clymer's  phase-locked  loop  design 

2.1  The  Optical  Receiver/Clock  Amplifier  cells 

Each  Optical  Receiver/Clock  Amplifier  unit  incorporates  a  phase-locked  loop 
approach  to  the  problem  of  receiving  an  optically  distributed  clock  and  dis¬ 
tributing  it  locally  on  a  silicon  chip.  The  phase-locked  loop  used  is  a  hybrid 
approach  in  which  analog  circuits  are  used  for  phase  detection  and  filtering, 
while  a  digital  circuit  is  used  for  the  actual  oscillator.  A  block  diagram  based 
on  Clymer's  work  [2,  pp.  44]  is  presented.  (See  figure  2.2.)  However,  due  to  re¬ 
strictions  imposed  by  the  standard  technology,  several  modifications  to  Clymer’s 
design  were  necessary.  These  modifications  will  be  noted  m  each  subcircuit  of 
the  ORCA  is  now  detailed. 

2.1.1  The  Voltage  Controlled  Oscillator 

The  VCO  is  basically  a  digital  ring  oscillator  comprised  of  five  inverters  with 
one  transmission  gate  in  the  loop.  (See  figure  2.3.)  The  transmission  gate  has 
its  NMOS  gate  fed  with  a  control  signal  CONTR,  and  the  PMOS  gate  is  driven 
by  a  signal  CONTR—  generated  by  passing  CONTR  through  a  minimum  size 
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CONTR-- 


Figure  2.3;  B2isic  digital  voltage  controlled  oscillator 

inverting  amplifier.  Hence,  the  signal  CONTR  acts  voltage  control  on  the 
equivalent  resistance  of  the  transmission  gate,  which  in  turn  acts  as  a  control 
on  the  ring  oscillator's  frequency.  SPICE  simulations  indicate  that  this  design 
can  provide  an  oscillator  frequency  in  excess  of  200MH2  while  insuring  that 
the  oscillator  will  not  reach  a  stable  state  under  worst-case  conditions  of  input 
voltage  and/or  fabrication  variations.'* 

The  oscillator  actually  implemented  has  had  its  frequency  adjusted®  to  be 
approximately  lOOMHz  when  the  control  voltage  CONTR  is  approximately  4.2 
volts.® 

Note  that  this  VCO  design  is  slightly  modified  from  Clymcr’s  [2,  page  44] 
in  that  four  of  the  pass  transistors  have  been  removed  and  the  fifth  has  been 
changed  to  a  transmission  gate.  By  changing  the  pass  transistor  to  a  transmis- 

*Note  that  there  must  be  an  odd  number  of  inverters  in  the  ring  oscillator  to  prevent  it 
from  settling  to  a  steady-state.  Furthermore,  simulations  indicate  that  the  feedback  of  a  ring 
oscillator  with  fewer  than  five  inverters  could  result  in  its  reaching  a  stable  state  (such  as 
2.5V)  rather  than  the  oscillation  (between  0  and  5  V)  which  i.'  desired. 

^Adjustment  is  done  via  adjustment  of  the  lengths  and  widths  of  the  transistors  in  the 
transmission  gate  to  affect  its  equivalent  resistance. 

'This  is  due  to  requirements  imposed  by  the  photodetector/ amplifier  block  and  will  be 
explained  later  (on  page  19). 
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sion  gate,  we  necessitate  the  generation  of  a  second  control  signal  CONTR— 
However,  we  gain  a  significant  speed  advantage  in  the  ring  oscillator  since  the 
pass  transistor  suffers  the  well-known  “threshold  drop”  problem,  while  the  trans¬ 
mission  gate  can  pass  signals  over  the  entire  voltage  range.  Also,  a  pass  tran¬ 
sistor  would  require  that  CONTR  always  be  above  approximately  3.3  volts  in 
order  to  keep  the  five  inverters  of  the  ring  from  becoming  decoupled;  but  by 
using  a  transmission  gate,  the  range  of  voltages  over  which  CONTR  can  vary 
increases  slightly. 

Furthermore,  removing  four  of  the  pass  transistors  increases  the  maximum 
operating  frequency  of  the  oscillator  at  the  expense  of  decreasing  the  range  of 
frequencies  over  which  the  loop  can  be  operated.  However,  as  we  shall  see,  the 
photodetection  circuitry  already  imposes  a  limit  on  the  range  of  frequencies  over 
which  the  oscillator  can  operate.  Thus,  we  can  increase  the  maximum  operat¬ 
ing  frequency  of  the  circuit  without  significant  penalty  (in  terms  of  operating 
frequency  range.) 

By  modifying  Clymer's  design  as  described  above  and  by  ratioing  the  inverter 
dimensions  to  provide  increased  drive  current  to  the  transmission  gate  while 
presenting  it  with  a  near  minimal  load.**  the  maximum  frequency  of  the  ring 
can  be  increased  from  approximately  6TMHz^  to  over  200MHz,  according  to 
SPICE  simulations. 

2.1.2  The  Loop  Filter  and  Low  Pass  Filter 

As  noted  by  Clymer{2,  pp.  44-53]  and  detailed  in  Gardner  [3],  the  use  of  a  simple 
lag-lead  filter  is  well  understood  in  phase-locked  loop  theory.  Furthermore,  it 
has  reasonable  layout  requirements  and  a  small  steady-state  phtise  error.  Hence, 
Clymer’s  choice  of  a  loop  filter  design  is  used  here.  As  for  the  low  pass  filter, 

■  CONTR~  should  be  approxinuteiy  V^^^-CONTR. 

^Minimal  in  the  senae  of  paraaiiic  capacitance  which  must  charge  through  the  T-gate. 

’Simulation  of  Clymer's  design  using  the  SPICE  models  indicated  in  Appendix  B. 


9 


RlslOOk  R2=31.4k 

\  \ 


LF.Input 


COI^  CONTO - 1 


Cl=.5pF  /  C2=JpF 

R3=32.3k 


5v-inout 


Figure  2.4;  "‘Ideal™  low-pasa/lag-lead  filter  combination. 


we  are  subject  to  three  basic  criteria:  First,  we  need  to  filter  out  the  lOOMHz 
signal  from  the  phase  detector's  comparison,  for  residual  signal  will  result  is 
unwanted  variation  on  the  control  voltage  of  the  VCO  causing  jitter  in  the 
VCO’s  output  frequency.  Second,  we  do  not  want  to  unduly  increase  the  lock-in 
time*®  of  the  phase-locked  loop  by  making  the  RC  time  constant  of  the  low  pass 
filter  too  large.  Finally,  we  must  be  able  to  implement  the  filter  using  standard 
technology,  which  limits  the  range  of  component  values  which  we  may  use. 

In  answer  to  these  criteria,  a  simple  RC  low  pass  filter  has  been  designed 
with  its  breakpoint  set  at  r  =  20MHz.  This  provides  adequate  filtering  of 
the  lOOMHz  signal,  and  it  can  be  implemented  using  a  100kI2  resistor  and  a 
0.5pF  capacitor.  .4  diagram  of  the  ■‘ideal™  “  cascaded  low  pass/lag-lead  filter  is 
provided  in  figure  2.4,  and  a  frequency  response  plot  for  this  filter  is  shown  in 
figure  2.5. 

However,  one  must  remember  that  we  are  working  in  a  technology  which 
does  not  easily  provide  capacitors  and  resistors — instead  we  use  transistors  to 
approximate  the  characteristics  of  these  devices.  (See  figure  2.6  for  transistor 
implementation  of  circuit.)  In  doing  so.  the  linear  approximation  (of  modeling 


^°Time  from  when  the  circtui  is  switched  on  until  the  VCO  “lociu''  to  the  optical  si^al. 
“Ideal”  is  used  in  the  sense  that  the  circuit  shown  assumes  ideal  resistors  and  capacitors, 
rather  than  the  transistors  which  must  be  used  to  implement  them. 
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Figure  2.6:  Low-pasa/lag-lead  filter  combination  as  actually  implemented  on 
the  OCD2  chip. 


these  transistors  as  resistors  and  capacitors)  is  reasonably  valid  over  only  a 
limited  range  of  input  voltages  to  the  filter.  Fortunately,  we  intend  to  operate 
the  filter  in  such  a  way  that  the  output  voltage  (CONTR)  is  approximately 
constant  at  4.2  volts. If  we  assume  that  the  output  is  at  a  steady  state 
value  of  4.2  volts,  the  “capacitor’  C2  has  a  constant  gate  bias  of  approximately 
4.2  volts,  which  indicates  that  it  stays  ir  the  “linear”  regime  of  operation  and 
therefore  has  a  constant  capacitance  [11,  4].  Also,  as  long  as  the  “capacitor”  C3 
has  a  gate  voltage  above  its  threshold  voltage  (approximately  0.8  volts),  it  too  is 
in  the  linear  regime  of  operation.  Therefore,  it  is  valid  to  model  the  transistors 
used  to  implement  C2  and  C3  as  linear  capacitors. 

As  for  resistances,  R3  has  a  very  small  V05  and  a  (  Vos  (  of  4.2  volts.  Thus. 


'^Thia  ne^ects  the  “power  on”  charging  of  the  capacitors  of  the  filter.  When  the  chip  is 
first  powered  on.  the  capacitors  will  be  non-linear,  but  once  they  reache  a  steady  state,  the 
approximations  mentioned  above  become  valid. 

*^This  is  for  the  PMOS  htdf  of  the  resistor.  The  NMOS  half  has  a  gate-source  voltage  of 
about  .8  volts,  which  means  it  is  just  barely  “on  '  and  has  a  high  resistance.  Hence  the  parallel 
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the  ‘resistor  ’  is  in  the  linear  regime,  and  has  a  nearly  linear  resistance  over  the 
small  range  of  os  experiences.  Over  the  voltage  range  of  interest,  therefore, 
Ra  can  be  reasonably  modeled  as  a  linear  resistor.  R}  is  under  conditions  which 
are  similar  to  those  of  Ra,  and  can  reasonably  be  considered  a  linear  resistor 
for  identical  reasons.  Ri  is  a  more  complicated  case.  Both  the  source  and 
the  drain  of  this  resistor  experience  variations.  However,  the  variations  on  the 
LF  Jnput  side  of  the  resistor  are  generally  larger  than  those  at  the  other  side. 
Hence,  we  shall  assume  the  voltage  on  the  capacitor/ resistor  junction  side  of  Ri 
is  constant  at  4.2  volts.  Even  with  this  assumption,  however,  the  voltage  V os 
has  a  worst-case  range  of  approximately  (-3.2V,-i-I.0V).  Over  this  range,  Ri 
exhibits  non-linear  characteristics.  If  one  assumes  that  the  loop  locks  to  a  value 
where  the  input  voltage  range  is  smaller,  one  can  make  linearization  arguments, 
but  this  assumption  is  hardly  valid  in  all  cases.  Furthermore,  the  voltage  range 
is  lessened  by  the  loading  of  the  network,  and  this  would  tend  to  make  the 
resistance  characteristics  of  Ri  more  nearly  linear.  However,  Ri  is  still  basically 
non-linear  in  characteristic — we  merely  model  it  as  linear  to  get  an  estimate  of 
the  filter's  performance  and  then  check  its  actual  performace  at  the  frequencies 
of  interest  using  SPICE  simulation. 

Another  factor  to  consider  when  designing  the  filter  is  that  the  SPICE  mod¬ 
els  currently  available  from  MOSIS  appear  to  be  off  by  a  factor  of  two  when 
modeling  resistance  and  capacitance,  according  to  Burr  [1],  Hence,  we  must 
restrict  the  range  of  voltages  entering  the  low  pass  filter  to  fail  withing  a  range 
in  which  the  linear  approximation  is  reasonably  valid  and  we  must  be  careful 
that  a  ■‘factor  of  two”  error  in  capacitance/resistance  values  will  not  cause  the 
circuit  to  malfunction.  The  two  worst-case^®  transfer  curves  for  the  ideaP*  fil- 

PMOS  transisior’i  resistance  dominates. 

’*The  (-3.2  V, 4.1.0 V]  rom^e  comes  from  SPICE  simulationa  from  the  unloaded  output  of  the 
.NANO  sate  comparator  described  in  the  next  section.  It  is  reasonable  to  assume  that  this 
voltage  range  would  be  decreased  by  the  loading  of  the  NANO. 

’^subject  to  the  constraint  of  keeping  RC  constant,  since  the  charging  time  estimates  pro¬ 
vided  by  MOSIS  appear  to  be  accurate. 

‘*i.e.  comprised  of  capacitors  and  resistors  rather  than  transistors. 
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Figure  2.7:  Transfer  curve  for  low  resistance/high  capacitance  values. 

ter  are  shown  in  figures  2.7  and  2.3;  and  they  appear  identical  to  the  nominal 
ideal  filter.  As  the  linear  approximation  of  the  current  design  appears  adequate, 
SPICE  simulations  of  the  design's  response  to  sinusoidal  inputs  of  lOOMHz  and 
12.8MHz^^  were  performed  and  the  results  are  presented  in  figures  2.9  and  2.10. 
In  the  SPICE  simulation  of  the  12.8MHz  signal,  we  can  see  that  it  is  attenu¬ 
ated  somewhat  by  the  filter.  However,  to  keep  the  lOOMHZ  signal  well  filtered, 
we  cannot  easily  reduce  the  attenuation  of  the  12.8MHz  signal.  Thus,  there  is 
some  unavoidable  increase  in  lock-in  time  of  the  loop  (over  that  indicated  by 
Clymer’s  simulations.)  However,  once  the  clock  is  locked  to  the  optical  signal. 


*^Clyiner  [2,  pai(e  52]  calculated  a  '‘pull  in'*  ranfe  of  12.SMHz  for  his  original  design.  The 
12.SMHZ  signal  plotted  here  is  not  severely  attenuated  by  the  implemented  low-pass/lag-lead 
Alter  combination.  Thus,  the  '‘pull  in”  time  of  the  loop  is  not  excessively  increased  by  our 
choice  of  low  pass  filter  and  by  non-linear  effects  of  the  transistors  used  to  implement  said 
filter  combination. 
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Figure  2.9;  Spice  simulation  of  filter  driven  by  lOOMHz  sinusoid. 
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Figure  2.10:  Spice  simulation  of  filter  driven  by  12.8MHz  sinusoid. 


the  “lock-in  time”  benchmark  loses  significance.  Furthermore,  in  most  applica¬ 
tions,  it  is  irrelevent  how  long  the  clock  takes  to  lock  to  the  optical  signal  as 
long  as  it  does  so  within  a  reasonable  time.^'*  Hence,  the  low  pass/loop  filter 
combination  presented  here  should  be  adequ,iie  for  most  applications. 


2.1.3  The  Photodetector  and  Phase  Comparator. 

The  photodetector/phase  comparator/amplifier  shown  in  figure  2.11  represents 
the  most  significant  modification  of  the  ORCA  design  presented  by  Clymer  [2] 
and  shown  in  figure  2.12. 

There  are  several  reasons  for  modifying  Clymer’s  circuit.  First,  although 
biasing  the  photodiode  using  the  output  of  the  VCO  does  simplify  the  problem 
of  phase  detection,  it  has  implementation  problems.  The  p-n  junction  used  for 
photodetection  in  the  circuit  is  comprised  of  a  20/um  by  20/im  p"*" -diffusion  in  an 


perhaps  a  traction  of  a  second. 
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Figure  2.11:  Photodetection/amplification/comparator  circuit  as  implemented 
on  OCD2  chip. 
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Figure  2.12:  Clymer’s  photodetection  and  phase  comparator  circuit 
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n-weil.^®  Due  to  the  size  of  this  diffusion,  it  has  significant  capacitance,  which 
causes  noticeable  loading  on  the  VCO,  thereby  lowering  its  operating  frequency. 
To  correct  this  problem,  increasing  the  VCO  inverter’s  sivjes  is  a  possibility, 
but  doing  so  markedly  increases  the  power  drawn  by  the  VCO.  Also,  there  is 
still  a  problem  with  this  design  due  to  the  reverse  recovery  transient  exhibited 
by  the  photodiode.  In  this  phenomenon  [8,  pp.  170-173],  the  stored  junction 
charge  in  the  p-n  diode  causes  a  large  reverse  current  to  flow  when  the  VCO 
output  switches  from  low  to  high.  As  an  example,  for  equal  pullup  and  pulldown 
resistor  values  of  63kn,  the  reverse  transient  will  have  a  peak  amplitude  of  68/iA. 
decaying  as  the  stored  junction  charge  is  extracted.  Since  68^A  is  almost  seven 
times  as  great  as  the  photocurrent  we  intend  to  detect,  the  reverse  recovery 
transient  poses  a  serious  problem.  Because  of  these  problems  we  abandon  the 
use  of  the  VCO  output  to  bias  the  photodetector,  instead  biasing  it  with  a 
ratioed  inverter  with  its  input  tied  to  its  output.  To  replace  the  lost  phase 
detection  ability  of  the  VCO  biased  photodiode,  we  provide  a  simple  digital 
phase  detector  later  in  the  circuit. 

Another  difference  between  the  current  design  and  Clymer’s  design  is  the  use 
of  an  amplifier/inverter  chain  as  opposed  to  a  simple  resistor  biwing  configura¬ 
tion.  This  difference  arises  due  to  technology’s  constraints,  since  the  resistors 
in  Clymer’s  design  must  be  on  the  order  of  IMO  to  get  the  desired  range  of 
control  voltages.^®  Note  one  cannot  normally  get  these  resistances  using  biased 
transistors.^^  Therefore,  we  resort  to  detecting  the  small  voltage  difference*^ 
produced  by  the  photodiode,  and  then  amplifying  this  voltage  swing  through 


‘*To  help  limit  the  amount  of  injected  current  that  reache*  •urroundinx  circuitry,  a  “p-plug’’ 
ring  i*  placed  around  the  n-well,  “tying”  the  substrate  strongly  to  ground. 

^®The  voltage  range  of  interest  is  ~3.5-5.0  volts. 

**  Special  processes  used  for  memory  chips  provide  a  highly  resistive  polysilicon  layer,  but 
the  resistances  typical  of  such  a  layer  are  on  the  order  of  IGO  and  have  large  variations  in 
resistance  values.  Hence,  even  these  special  processes  are  inadequate  for  this  application. 

**The  voltage  swing  produced  by  the  photodiode  is  approximately  300mV,  according  to 
SPICE  simulations.  This  voltage  swing  is  due  to  the  pbotocurrent  acting  through  the  bittsing 
transistors  and  is  limited  by  the  resistances  of  the  biasing  transistors,  the  parasitic  capacitance 
it  must  charge,  and  the  operating  frequency  of  the  clocJt. 
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Dark 

High 

mmmamm 

Illuminaced 

Low 

Illuminated 

High 

Low  1 

Table  2.1:  Error  voltage  as  a  function  of  illumination  and  VCO  output. 


two  analog  stages.  These  identical  linear  amplifiers  are  ratioed  in  such  a  way 
that  their  quiescent  poinl^^  lies  in  the  center  of  their  input  voltage  range  at  the 
anticipated  clock  frequency,  while  they  maintain  a  nearly  linear  voltage  transfer 
characteristic  over  the  input  voltage  range.  In  this  way,  the  amplifiers  in  the 
chain  maintain  an  average  voltage  equal  to  their  quiescent  voltage,  and  they  are 
prevented  from  entering  the  non-linear  region  of  their  voltage  transfer  charac¬ 
teristics.  Once  we  have  amplified  the  signal  sufficiently,  we  use  two  high-gain 
inverters  to  extract  the  “logic  threshold  crossing’*^  information  from  the  signal 
and  to  amplify  the  signal  to  CMOS  logic  levels.  Finally,  the  resultant  signal  is 
digitally  N  AND-ed  with  the  VCO  output  to  create  an  error  voltage.  The  result 
of  the  NAND  operation  is  shown  in  table  2.1. 

Spice  simulations  have  been  performed  to  determine  what  the  transistor 
dimensions  for  the  amplifier  chain  should  be  in  order  to  maintain  a  strong  reverse 
bias  on  the  photodiode,  to  provide  a  reasonable  gain  per  amplifier  stage,  and 
to  keep  the  voltages  involved  from  leaving  the  linear  region  of  the  amplifier's 
transfer  curve.  The  resulting  transistor  dimensions  are  noted**  on  figure  2.11. 

Since  this  part  of  the  ORCA  is  mainly  analog  circuitry,  one  becomes  con¬ 
cerned  with  the  effects  of  process  variations  on  its  functionality.  SPICE  simula- 

^^Here  quiescent  point  is  defined  as  that  voita^  to  whicli  the  amplifier  would  settle  if  its 
output  were  connected  to  its  input. 

‘*i.e.  the  time  when  the  amplified  si^al  crosses  the  lope  threshold  of  the  first  inverter. 
The  lope  threshold  of  the  first  inverter  is  set  to  2.31/. 

^^Notation  is  in  the  form  W/L.  where  W  is  the  drawn  ?ate  width  in  nm  and  L  is  the  drawn 
gate  length  in  um. 
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tions  were  done  to  determine  how  worst-case*®  uniform  variations  in  transistor 
geometries*'  and  threshold  variations*®  would  affect  the  output  of  the  phase 
comparator  at  varying  values  of  skew  between  optical  signal  and  VCO  output. 
Results  indicate  that  at  lOOMHz,  the  worst-case^®  “average"’  output  value  from 
the  NAND  gate  occurs  for  a  short  and  wide  channel  at  2ns  of  skew,  and  this 
value  is  approximately  3.4  volts.  In  order  to  accomodate  this  worst  case,  the 
VCO  is  adjusted  to  run  at  lOOMHz  when  the  control  voltage  CONTR  is  approx¬ 
imately  4.2  volts — midway  between  the  3.4  volts  of  worst-case  NAND  output 
and  the  5  volts  of  “perfect  lock”  NAND  output. 

In  addition  to  being  concerned  over  the  effect  of  process  variations  on  the 
functionality  of  this  circuit,  one  is  also  concerned  about  the  limits  that  this 
photodetector/amplifier  circuit  may  impose  on  the  phase-locked  loop’s  operat¬ 
ing  frequency  range.  SPICE  simulations  also  indicate  that  if  the  optical  clock 
frequency  is  too  high,  the  parasitic  capacitance  at  the  input  to  the  first  analog 
amplifier  will  not  charge  to  a  high  enough  voltage  to  cause  the  inverters  in  the 
chain  to  switch  fully.  Hence,  the  output  signal  of  the  NAND  gate  will  no  longer 
function  as  a  valid  error  signal.  On  the  other  hand,  if  the  optical  clock  fre¬ 
quency  is  too  low.  the  parasitic  capacitance  will  charge  too  much.  This  causes 
the  analog  amplifier  stages  to  saturate  and  effectively  lengthens  the  pulse  seen 
by  the  NAND  gate,  since  the  inverters  detect  threshold  crossings.  Beyond  a 
point,  this  pulse-widening  causes  too  much  loss  of  skew  information  between 
the  optical  signal  and  the  VCO  output,  resulting  in  uncertainty  of  lock  in  the 
loop.  (See  figure  2.13.)  Hence,  the  ORCA  no  longer  functions  properly.®®  Since 


Wont  CSM  variations  are  determined  by  usin^t  data  acquired  from  a  typical  MOSIS  2.0um 
CMOS  run.  See  .Appendix  C. 

\  reasonable  model  of  the  effects  of  misali^ed  fabrication  masks. 

Modelling  doping/ process  variation  effects,  to  hnt  order. 

^^The  wont-case  is  the  highest  low  voltage  produced  by  the  unloaded  NAND  gate,  as  this 
is  the  wont-case  bottom  of  the  control  voltage  range.  Five  volts  is  assumed  to  be  the  top  of 
the  voltage  range. 

^®With  continued  decrease  in  frequency,  the  voltages  "walk  off"  the  Linear  portion  of  the 
transfer  curve.  .At  this  point,  the  input  to  the  .NAND  gate  shows  only  small  ( <  1  volt)  variations 
and  these  variations  are  not  near  the  logic  threshold  of  the  NAND.  Hence,  the  .NA.ND  loses 
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'  '  •  puiM'WiO^otd  ooticaiiy  r*c*«v«o  sipnAj 

■■  •  VCO  ot/Qut  ttgrui 

-  <^<'•0  «i««et*i  Br  NANO  e<i.) 


VCO  ugnal  A 


VCO  Mgnal  B 


II  4  aviOsni  thd  ina  los  VCO  tignala  nava  a  dWaranca  ItKawl  olin  raaoaa 
to  aaen  oinar.  Tha  NANO  oumul  •  nign.  aieaoi  M  tha  ovanao  ragion.  Sliiea  Oia 
tvartga  NANO  output  valua  4  tna  aama  in  oom  caaaa,  tna  anor  voitaga  on  tha 
VCO  •  tha  aama  ana  tnatalora  tha  ooaraing  Itaouanoaa  at  VCO  A  and  VCO  B 
ata  tha  aama.  Houmvaf.  tha  outaa  wioaning  cauaad  by  too  low  an  ooticaj  cioca 
Iraauancy  aiiowa  unoo/roctaoia  aaaw  to  ai4l  Ooiwoan  tna  two  VCO  ciocaa. 


Figure  2.13;  Illustration  of  how  too  low  an  optical  clock  frequency  can  cause 
loss  of  skew  information. 


too  short  or  too  long  a  pulse  width  causes  unreliable  operation  of  the  loop,  we 
see  that  the  this  photodetector/amplifier/phase-comparator  circuit  imposes  a 
limit  on  the  frequency  range  over  which  the  ORCA  can  lock  to  an  optical  input 
signal.  Thus,  the  loss  in  frequency  range  imposed  by  removing  four  of  the  pass 
transistors  in  the  VCO,  as  mentioned  on  page  9,  is  not  a  serious  limitation.^^ 
One  final  difference  between  the  current  design  and  Clymer’s  design  should 
be  mentioned.  In  Clymers  design,  the  output  of  the  phase  detector  was  high 
ifP*  both  the  VCO  output  was  high  and  the  photodiode  wtis  illuminated.  In  the 
current  design,  the  output  of  the  phase  detector  is  low  iff  these  same  VCO  and 
photodiode  conditions  exist.  Hence,  whereas  Clymer’s  circuit  tended  to  “pull” 
the  VCO  into  lock  at  0®  li  (where  5  is  a  small  steady-state  phase  error),  the 
current  design  “pushes”  the  VCO  into  lock  at  180®  -I-  d.  This  should  pose  no 


aU  information  from  the  phoiodecector. 

SPICE  simulations  reveal  that  the  dominart  limitation  of  frequency  range  of  operation 
arises  from  the  photodetector/ amplifier  circuit,  rather  than  the  VCO. 

is  used  in  the  mathematical  convention  meaning  “if  and  only  if." 
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problem,  since  the  skew  between  the  optical  clock  and  the  VCO  is  unimportant; 
it  is  the  skew  between  VCO’s  of  different  ORCAs  on  the  same  chip  that  concerns 
us.  Furthermore,  the  average  output  of  the  NAND  ranges  from  ~3. 0-5.0  volts, 
which  is  ideal  for  controlling  the  VCO.  Hence,  the  NAND  output,  once  filtered, 
should  serve  well  as  the  control  voltage  for  the  ring  oscillator. 

Now  that  the  photodetector/amplifier/phase  comparator  has  been  described, 
we  examme  the  circuitry  necessary  to  convert  the  output  signal  of  the  VCO  into 
a  useable  system  clock  signal. 
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Chapter  3 

Testing  of  Chip 


One  possible  optical  setup  for  testing  the  0CD2  chip,  along  with  appropriate 
test  procedures,  is  now  detailed. 


3.1  Optical  Test  Setup 

To  facilitate  easier  testing,  only  one  optical  setup  should  be  used  for  all  of  the 
suggested  tests.  Note  that  this  would  help  to  maintain  consistency  between 
tests,  since  different  setups  might  cause  variations  in  optical  power  incident  on 
the  photodiode,  and  such  variations  could  affect  the  operation  of  the  ORCAs.^ 
The  suggested  optical  setup  in  shown  in  figure  3-1. 

As  Clymer  noted  [2,  pp.  15-17],  a  visible  (i.e.  He-Ne)  laser  source  would 
have  higher  efficiency  since  shorter  wavelength  light  results  in  rapture  of  more 
photons  in  the  photodiode’s  depletion  layer.  However,  most  visible  lasers  must 
be  externally  modulated  by  an  acousto-optic  light  modulator  (AOLM).  Since 
our  circuit  expects  5ns  pulses  (lOOMHz  square  wave),  we  cannot  afford  a  rise  or 
fall  time  of  more  than  about  Ins.  Thus,  we  would  require  an  AOLM  with  about 

'Thi»  is  due  to  the  previously  mentioned  sensitivity  of  the  photodiode's  amplifier  chain. 
See  page  20. 
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chip  mounted 
on  rotation  stage 
on  x-y-z  translator 


Figure  3.1:  Optical  setup  used  in  all  chip  tests, 

IGHz  of  bandwidth.  Such  AOLMs  are  extremely  expensive*  and  are  not  very 
efficient.  As  an  alternative,  a  laser  diode  system  can  be  directly  modulated,  but 
most  laser  diodes  operate  in  the  infrared  (at  approximately  820nm)  where  the 
p-n  photodiode  is  not  very  efficient.  However,  one  laser  diode  system  was  found 
which  features  a  ToOnm  beam,  a  modulation  input,  and  colliminating  optics. 
Thus,  the  capture  efficiency  is  better  than  for  most  laser  diode  systems,  and  we 
needn’t  worry  about  aligning  the  diode  to  collimating  optics.  Furthermore,  its 
specifications  (which  are  included  in  Appendix  D)  indicate  that  the  modulation 
input  has  a  range  of  lOOkHz-oOOMHz.  Since  500MHz  corresponds  to  Ins  each 
of  rise  and  fall  times,  this  laser  diode  system  could  serve  our  purposes  very  well. 

In  one  possible  setup,  the  laser  beam  emitted  from  this  system  would  fol¬ 
low  the  path  indicated  in  figure  3.1.  Note  that  each  beam  is  reflected  once 
and  transmitted  once,  so  each  should  have  the  same  optical  power  (neglecting 
mirror  losses)  even  if  the  beamsplitters  are  slightly  anomalous  in  their  trans- 

*on  the  order  of  S20.000. 
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mittance/reflectance  characteristics  at  this  wavelenijth.  (Sample  beamsplitter 
specificatioii  sheets  are  also  provided  in  Appendix  D.) 


After  leaving  the  beamsplitter  cubes,  the  beams  would  then  be  focused  by 
the  lens  into  two  small  spots  on  the  chip.  The  chip  can  be  translated  in  all  three 
dimensions  and  can  be  rotated  in  order  to  position  the  two  beams  so  that  they 
would  impinge  on  two  adjacent  ORCAs’  photodetectors  while  allowing  control 
of  the  spot  size.  The  separation  of  the  two  beams  could  be  controlled  using  the 
1-D  translator  on  the  second  mirror. 

Optical  power  considerations  for  this  suggested  setup  are  as  follows:  the 
suggested  laser  emits  up  to  4mVV  of  optical  power.  Each  sample  beamsplitter 
is  45%  transmitting/ 45%  reflecting.  Hence,  each  beam  of  light  has  SIO^W  of 
power  exiting  the  second  beamsplitter.  (We  neglect  losses  due  to  the  mirrors.) 
With  a  suggested  lens^  we  could  focus  the  4.5mm  .x  2.0mm  spot  down  to  a  27^m 
X  12^m  spot.  However,  to  make  sure  the  phoiodetector  is  fully  illuminated,  we 
would  defocus  the  beams  so  that  the  smallest  dimension  is  SO^im.  At  this  point, 
the  optical  power  incident  on  each  detector  is  approximately  Z24nW^  Since  the 
responsivity  of  the  p-n  junction  is  approximately  0.12-0.15  A/W  (neglecting 
reflection  losses  at  the  chip),  [5.  9,  pp.  743-756],  this  corresponds  to  a  photocur¬ 
rent  of  at  least  39.2/iA.  We  adjust  the  power  and/or  focusing  of  the  beam  to 
scale  this  photocurrent  down  to  10a<A,  since  this  is  the  value  we  used  in  the 
SPICE  simulations. 

We  now  proceed  to  detail  suggested  tests,  utilizing  this  suggested  optical 
setup. 


^The  lens  has  a  focal  length  of  TOmm  and  a  diameier  of  SOniin. 

’Since  the  phocodecector  is  square  and  the  incident  beam  in  elliptical  in  nature,  the  rest 
of  the  optical  power  would  be  lost  on  the  shielding  around  the  photodetector. 
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3.2  Pre-testing 

Before  attempting  to  test  any  given  chip  using  the  previously  described  optical 
test,  one  should  verify  the  functionality  of  ats  much  of  the  chip  as  possible  without 
the  optics. 

To  test  the  skewdetector  blocks,  the  clocks  should  be  allowed  to  run  at 
whatever  natural  frequency  they  settle  to  in  the  absence  of  an  optical  signal. 
An  oscilloscope  should  be  used  to  monitor  the  charging  characteristics  of  the 
skew  pins  (SkewO-Skewll)  while  periodically  activating  the  discharge  line.  Also, 
the  same  pins  should  be  monitored  while  the  calibrate  line  is  active.  In  this  way, 
‘stuck  at  zero’  and  “stuck  at  one’  faults  on  the  skew  pads  and  the  discharge 
pad  could  be  detected. 

For  the  frequency  divider  testing,  the  four  frequency  output  pins  (FreqOutO- 
FreqOut3)  should  be  monitored  while  the  various  select  lines  (Select3-Select0) 
are  set  in  turn  to  each  hexadecimal  value  from  0  to  3.  An  oscillation  on  the 
output  pins  with  a  frequency  in  the  “reasonable”  range  indicates  that  the  cor¬ 
responding  phase  of  the  corresponding  clock  is  oscillating  correctly  and  that 
the  multiplexor/ frequency  divider  circuit  is  operational  at  the  natural  clock  fre¬ 
quency.  Note  that  the  “Init”  line  needs  to  be  pulsed  before  the  frequency  divider 
is  guaranteed  to  function  properly. 

For  the  shift  register  testing,  one  should  monitor  the  four  error  pins  (ErrorS- 
ErrorO).  The  Init  line  should  be  pulsed  to  set  the  T  flip-flop  to  a  guaranteed 
operational  state,  and  after  a  short  while  (roughly  100  clock  cycles)  the  shift 
register  should  have  loaded  with  alternating  ones  and  zeros.  From  this  point 
on.  there  should  be  no  toggling  of  the  error  lines.  Observation  of  the  turn-on 
transient  of  this  line  would  verify  the  lack  of  “stuck  at”  faults  in  the  shift  register 
circuit,  and  observation  of  this  line  after  the  shifter  is  loaded  would  verify  that 
the  shift  register  test  is  operating  correctly. 

This  completes  the  testing  which  can  be  done  without  the  use  of  the  optical 
setup.  We  now  proceed  to  the  suggested  tests  which  can  be  performed  using 
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3.3  Testing  with  Optics 

Those  chips  which  pass  the  suggested  pre-testing  should  then  be  inserted  in  the 
optical  test  setup.  After  adjusting  the  beams  so  that  approximately  lO/xA  of 

photocurrent  is  generated  in  each  of  two  adjacent  photodetectors,  the  following  | 

measurements  should  be  made. 

3.3.1  Skew  Testing 

With  the  calibrate  line  inactive,  a  square  wave  should  be  applied  to  the  initial¬ 
ization  line  to  provide  for  periodic  discharging  of  the  skew  lines.  By  studying 
the  charging  characteristics  of  the  output  line  responsible  for  measurement  of 
skew  between  the  two  ORCAs  of  interest  and  comparing  these  characteristics 
to  those  generated  by  the  same  output  line  when  the  calibrate  line  is  active,  a 
skew  measurement  could  be  obtained.  This  same  measurement  should  then  be 
made  with  the  introduced  path  length  difference  affecting  the  second  ORCA's 
photodetector.  An  average  skew  should  then  be  computed. 

3.3.2  Frequency  Output 

Unfortunately,  to  drive  the  output  pins  reliably,  one  must  frequency  divide  the 
oscillator  before  outputting  it.  This  results  in  a  loss  of  frequency  resolution. 

However,  in  this  application,  we  merely  wish  to  verify  that  the  oscillator  has 
locked  to  the  optical  clock  frequency  and  not  some  harmonic  of  it.  To  do  this, 
we  set  the  select  lines  (SelectS-SelectO)  to  the  hex  value  of  the  oscillator  under 
consideration  and  observe  the  output  on  the  frequency  output  pins  (FreqOut3- 
FreqOutO). 


27 


3.3.3  Shift  Register  Test 


In  this  test,  the  “Error'’  lines  are  observed  and  an  estimate  of  how  many  tran¬ 
sitions  it  makes  in  a  ten  second  intervsJ^  is  taken.  Note  that  each  transistion 
(excluding  those  involved  in  the  start-up  transient)  indicates  that  an  error  oc¬ 
curred. 


*.Note;  the  stArt-up  Craiuient  should  not  included  in  the  ten-second  interval  of  test.  In  this 
way,  there  should  ideally  be  no  tranaistiona  during  testing. 
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Chapter  4 

Conclusions 

Here  we  present  a  summary  of  the  work  done  and  suggestions  for  future  work. 

4.1  Summary  of  Contribution 

This  work  demonstrates  that  a  reasonably  modular  circuit  can  be  designed  to 
implement  a  phase-locked  loop  approach  to  optical  clock  distribution.  Simula¬ 
tions  indicate  that  said  circuit  can  be  operated  at  a  clock  frequency  of  100  MHz, 
and  a  test  chip  featuring  this  circuit  along  with  appropriate  diagnostic  circuitry 
has  been  fabricated.  Finally,  suggested  test  setups  and  procedures  have  been 
detailed. 

4.2  Future  Work 

The  most  obvious  suggestion  for  future  work  is  to  perform  the  indicated  tests. 
Assuming  that  said  tests  confirm  the  functionality  of  the  circuit,  a  more  elabo¬ 
rate  circuit  should  be  designed — perhaps  implementing  a  non-trivial  high  speed 
function  such  as  a  discrete  Fourier  transform — using  this  optical  clock  distri¬ 
bution  circuit.  Such  a  device  would  require  a  far  more  sophisticated  optical 
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system  including  a  hologram  to  focus  the  optical  clock  onto  the  receptors,  a 
higher  power  laser  to  drive  more  than  two  receptors  at  a  time,  and  a  high  speed 
optical  modulator. 

On  the  other  hand,  should  the  fabricated  chips  fail  their  diagnostic  tests, 
it  would  be  necessary  to  determine  why.  Perhaps  current  SPICE  models  are 
simply  not  adequately  accurate  for  this  process.  Perhaps  skew  introduced  by 
detection  of  the  light  is  excessive  and  non-standard  processes  would  need  to  be 
utilized.  There  are  indeed  a  number  of  directions  to  explore  given  a  failure  of 
the  test  chips  to  perform. 
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Appendix  A 

Pad  Assignments 

A  summary  of  the  pad  assignment  on  the  0CD2  chip. 
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Appendix  B 

SPICE  parameters 


A  summary  of  SPICE3  parameters  used  in  simulating  the  analog  portions  of 
the  0CD2  chip. 


33 


SPICE  parameter 

•NMOS  value 

PMOS  value 

level 

2 

2 

vio  ( 

0.812464 

0.747301 

tox  ( m ) 

413E-10 

413E-10 

asub  (cm~^) 

8.644E+15 

1.774E+16 

xj  (m) 

0.4 

0.4 

Id  {(i) 

0.065 

0.071 

uo  (cm-/V-s) 

535.36 

^Esmi 

ucrit  (V/cm) 

39202.4 

10000 

uexp 

0.10088 

vmax  { m/s) 

5.1E-r4 

3.5067E-r4 

delta 

0.36 

l.OE-6 

rsh  (Q/sq) 

34.37 

129.5 

cgso  ( F/m) 

lE-10 

1.5E-10 

cgao  (F/m) 

lE-10 

1.5E-10 

\mmsamm 

2.535E-4 

cjsw  (F/m) 

5.87E-10 

3.33E-10 

mj 

0.6744 

0.5213 

mjsw 

0.2801 

pb  (V) 

0.71 

0.71 

kp  ( A/V*) 

4.476e-5 

2.008E-5 

lambda  ( V“  ‘ ) 


gamma  f 


phi  ( V) 


nls  (cm~-) 


js  (A/m*' 


netf 


8 


0.6 


5.05E12 


lE-14 


l.OOlE-2 


0 


1 


0.5298 


0.6 


3.19697E12 


lE-U 


i.OOlE-2 


0 


-i 
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Appendix  C 

Process  variations 


Selected  measured  process  variations  for  the  2.Qiitn  n-well  CMOS  MOSIS  run 
M71K  are  presented.  These  are  the  worst  case  variations  and  were  used  in  the 
simulations  of  the  amplifier  chain  on  page  Id- 


process  parameter 

n-  ~ 

worst  case  low 

worst  case  high 

NMOS  threshold  voltage  (V) 

0.6424 

1.265 

PMOS  threshold  voltage  (V) 

-0.7038 

-1.354 

NMOS  delta  length  (pm) 

-.352 

+.352 

PMOS  delta  length  (pm) 

-.216 

+.216 

NMOS  delta  width  (pm) 

-.577 

+.577  1 

PMOS  delta  width  (pm) 

-.732 

+.732 

Note  that  the  delta  length  and  delta  width  are  the  variations  with  respect 
to  some  nominal  length  or  width.  For  example,  a  2^m  NMOS  drawn  channel 
length  may  actually  vary  from  l.648fim  to  2.352/im. 
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Appendix  D 

Specifications 


Copies  of  the  specification  sheets  for  the  following  test  equipment  used  are 
provided  on  the  following  pages. 

1.  Laser  Diode  specifications  taken  from  Oriel  Lasers  &  Accessories  Catalog, 
®  1988  by  Oriel  Corporation,  Stratford,  CT. 

2.  Beamsplitter  specifications  taken  from  Melles  Gnot  Optics  Guide  4<  ©  1938 
by  .Melles  Griot.  Used  with  permission. 
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DICDE  LASER  SYSTEMS 


a  L:mpact.  Economical.  Ccmc;e'-9  ^aciocas 

■  Tnoice  of  Waveienc:"  =''c  -c  .ver 
»  Modulation  Inout 

■  Powers  to  25  mW 

■  Vanous  Beam  Sizes 

■  Meet  CORH  Safety  ^eouirem.ents 


Oriel’s  new  compact  Diode  Laser  Systems  are  ideal 
for  the  development  of  applications  of  these  versatile 
solid  state  lasers.  Our  units  are  complete,  ready  to 
plug  in  and  turn  on.  The  tiny  laser  diodes,  wJ.ich  are 
vulnerable  to  static,  are  already  safely  wired.  Pre- 
aligned  optics  collect  and  collimate  the  lignt 
efficiently. 

OniEL  DfODE  LASER  SYSTEMS 

Oriel  Diode  Laser  Systems  use  oroven  GaAIAs  lasers  witn 
output  powers  from  4  to  25  mW  and  waveienqtns  from  750  to 
B30  nm.  The  lasers  are  ouilt  on  tiny  cnips  of  p-type  GaAs 
substrates.  (Fig.  1) 

The  active  layer  IS  only  a  fraction  of  a  micron  thick,  wner^ 
current  flows  througn  this  layer,  lignt  Is  emitted  as  electrons 
and  holes  recombine  The  refractive  moex  of  the  active  laver 
is  nigner  than  that  of  me  'avers  acove  ana  beiow. 

Any  light  stnking  the  interface  ai  a  large  angle  ot  incidence 
is  internally  reflected.  As  in  a  fiber  ootic  the  'ignt  isquicec  n 
the  high  refractive  mcex  region.  A  V  groove  mrougn  i 
•  urrer*  blocking  layer  confines  the  current  flow  lateraiiv, 
:efining  the  gain  region.  Lignt  wnicn  escapes  is  apsorcec  .n 
the  unpumped  crystal.  This  "'s  called  gain  guiding’  ~''e 
dimensions  of  the  lasmg  region  ensure  staple  operation  cf 
the  fundamental  transverse  mode,  typ  other  mode"^  z3n 
operate.The  higher  power  Chios  nave  optical  coatings  ^  ne 
front  and  rear  surfaces  to  optimize  iignt  extraction. 


Each  laser  cnip 's  mounted  m  a  can  as  shown  in  Fig.  2.  A 
P'M  onotodiode  locateo  oenma  the  laser  chib  continuously 
■nonitprs  the  output  ‘i-om  the  rear  of  the  laser.  We  use  the 
pnoTOGiode  signal  n  a  'eeacack  ;oop  to  staoilize  the  laser 
output.  A  large  aluminum  heatsink  keeps  the  diode  can  cool. 
Our  anve  circuit  powers  the  laser  and  maintains  steady 
output.  The  circuit  mciuaes  a  CC  ioop  for  control  of  total 
power  and  an  AC  tooo  to  minimize  noise.  Bypass  capacitors 
ana  a  slow  start  circuit  protect  the  laser  chip  from  fatal 
surges. 


DIODE  LASER  SYSTE:. 


WINDOW 


PHOTO  DIODE 


LASER  CHIP 


^ - HEAT  SINK 


FI9.2  The  icMr  and  monitor  photodiode. 

LASER  CONTROLS 

Operating  any  Oriel  Diode  Laser  System  is  straightfor¬ 
ward.  Simply  plug  the  DC  power  supply  into  the  rear  of  the 
laser  head  and  into  the  line,  wire  the  safety  interlock 
appropnately,  and  turn  the  key  switch.  A  green  warning  light 
is  activated  and  a  lew  seconds  later  the  laser  comes  on.  A 
control  knob  on  top  of  the  laser  changes  the  output  power 
from  about  20  to  more  than  100  %  of  rated  power.  The  safety 
shuttc.  may  be  used  to  stop  the  beam. 

A  BNC  jack  at  the  rear  of  the  laser  accepts  modulation 
input. 

MODULATION  INPUT 

You  can  modulate  Oriel  Diode  Laser  Systems  at  frequen¬ 
cies  from  100  kHz  to  500  MHz  by  using  the  BNC  connector 
on  the  rear  of  the  laser.  The  input  is  AC  coupled.  Lower 
frequency  modulation  is  not  possible  except  to  special 
order.  Typically  3-5  volts  into  the  50  ohms  input  results  in  50 
-50  %  modulation.  Up  to  90%  modulation  is  possible  with  17 
oSm  RF  power.  The  power  control  circuit  automatically 
adjusts  DC  bias  power  to  Keep  the  average  output  power 
constant. 

LIFETIME 

Diode  lasers  usually  fail  for  one  of  two  reasons. 

•  Thermal  stress  on  the  semiconductor. 

*  Damage  to  the  laser  output  facet. 

Thermal  stress  from  running  the  diode  (or  from  soldering 
It  in  place)  can  result  in  the  growth  of  tiny  flaws  in  the  crystal 
structure.  When  crystal  defects  become  large,  the  diode 
cannot  lase. 

As  delect  development  and  propagation  is  temperature 
dependant,  the  lifetime  is  dramatically  shortened  by  running 
Jf  higher  than  design  temperatures.  MTBF  drops  from 
200,000  hours  for  a  case  temperature  of  30  degrees  to  20,000 
•'ours  (Z5  years)  for  a  case  temperature  of  65  degrees.  The 
•iigher  power  lasers  have  shorter  lifetimes. 

The  laser  power  density  at  the  output  facet  of  the  chip  is 
''ery  hign.  This  leads  to  gradual  darkening  of  the  facet  from 
Photochemical  changes.  The  MTBF  figures  indicate  thatthis 
**  hot  a  senous  problem.  However,  static,  power  surges,  or 
high  turn  on  currents  can  resuit  in  momentary  high  power 
Pperation  of  the  laser.  At  25  mW  on  a  1  by  5  micron  typical 
Potput  facet,  the  continuous  power  density  in  normal 
Pperation  exceeds  several  MW/cm^,  Much  higher  power 
P*hsity  from  transients  causes  catastrophic  damage  to  the 

I^ip  surfaces  wnicn  act  as  the  laser  reflectors,  destroying 
^  laser. 


SAFETY 

Appreonata  o.,.‘4r7  -leasures  snould  be  used  when  work¬ 
ing  with  any  laser  "i-ie  nign  output  power  from  the  tiny  diode 
laser  package,  arc  the  fact  that  most  are  invisible,  demanos 
special  precautions  for  the  operator  and  control  of  access  to 
the  working  area.  Some  of  our  diode  laser  beams  are  bareiy 
visible.  This  can  be  misleading.  A  25  mW  laser  producing  a 
faint  red  spot  is  as  oangerous  as  a  bright  25  mW  argon  ion 
laser,  and  potentially  more  so  because  of  the  natural 
association  of  danger  with  the  bright  beam  and  the  protec¬ 
tion  from  the  'blinx  resoonse*. 

Our  lasers  induce  a  shutter,  key  switch,  labels  and 
intehock  as  required.  Safety  eyewear  is  listed  on  pages  30  to 
31  and  door  signs  on  page  32. 
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_  -  DIODE  LASER  SYSTEMS 


I 


I 
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BEAM  J:--.;RACTERISTICS 


Dicce  lie '3  ao  not  nave  the  familiar  circular  beam  shape 
with  Gaucc  in  intensity  distribution  of  a  TEMoo  Helium 
Neon  Laser.  The  beam  is  elliptical,  does  not  have  a  true 
Gaussian  profile,  but  does  have  the  high  brightness  and  low 
divergence  necessary  for  focussing  to  small  areas  or  coup¬ 
ling  to  optical  fibers. 

Our  lasers  are  all  fundamental  trattsverse  mode  TEoo.  The 
assymetry  in  the  vertical  and  horizontal  lasing  region 
appears  m  the  collimated  elliptical  beam  which  has  low  but 
different  divergences  parallel  and  transverse  to  the  lasing 
stripe. 

The  beam  is  also  astigmatic  liKe  most  diode  laser  sources. 
The  minor  and  maior  axes  of  the  ellipses  appear  to  have 
originated  in  different  spots  a  few  microns  apart. 
Refocussed  beams  have  the  minima  of  the  major  and  minor 
axes  in  different  positions.  This  is  unimportant  except  for 
chtical  tight  focussing  applications. 

Fig.  6  shows  actual  beam  shapes  of  our  79426  Laser 
System.  This  is  typical  for  our  devices  with  larger  beams. 
The  beam  charactenstics  of  the  smaller  beam  systems  are 
more  complex.  The  source  wavefronts  and  abberations  of 
the  very  high  F/#  lens  used  to  collimate  the  laser  light  from 
this  source  result  in  unusual  beam  patterns  up  to  500  mm 
from  the  source.  (See  page  49  for  more  beam  profiles). 
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Fig.  4  Output  ol  tho  Modal  79426  ovor  a  4  hour  partod. 


Rg.  5  Typical  short  tarm  powar  variation  of  ttia  modal  79426. 


OUTPUT  STABILITY 

Tho  :3spr  output  IS  Constantly  monitored  by  the  pho- 
todioce  .v.nic.n  ;s  not  sensitive  to  temperature.  The  output  is 
temperature  compensated.  The  slow  DC  loop  stabilizes  the 
power  to  better  than  2%  (Fig.  4).  The  A.C.  loop  maximizes 
sign  :;i,s  ratio.  (Fig  5). 

Th  ■'  -w  lasers  can  exhibit  occasional  power  fluctua¬ 
tion;  .  3w  percent  due  to  longitudinal  mode  transitions. 
All .  irs  are  sensitive  to  reflection  of  the  beam  bacK  into 
the  die  .  s.  This  can  cause  completely  unstable  operation. 


Fig.  6  Horizontal  (laft)  and  vartical  (right)  Intensity  profllas 
of  the  79426  Laser  System  at  4m  from  the  laser  output 


SPECTRUM  I 

We  offer  diode  laser  systems  with  a  choice  of  wavelength 
from  750  to  830  nm.  The  actual  wavelength  will  be  within  ♦/-  | 

5  nm  of  the  nominal.  When  the  laser  is  warmed  up  and  I 
running  at  full  power,  the  output  spectrum  is  the  sharp  spike  j 
characteristic  of  single  longitudinal  mode  operation.  The  ; 
width  of  this  spike  was  measured  as  0.6  G!-lz  (1.2  pm)  for  our 
780  nm  5  mW  laser.  The  linewidth  was  measured  using  a  7.5 
Ghz  FSR  Scanning  Interferometer.  (Fig.  7)  The  narrow  line  j 
has  a  sensitivity  to  diode  can  temperature  of  about  0.25nm/  , 

degree  C.  Significant  variations  in  ambient  will  influence  the  | 

wavelength  but  not  in  a  smooth  monotonic  fashion.  Mode  j 
hops  of  a  fraction  of  a  nanometer  occur  as  the  wavelength  i 
increases  with  temperature.  | 

The  wavelength  also  changes  with  the  power  control.  (Fig.  ; 

8,  page  49)  At  low  power,  some  lasers  operate  in  many  | 
longitudinal  modes  and  the  spectrum  appears  as  the  familiar 
comb  of  modes.  To  ensure  single  mode  output,  operate  > 
these  devices  at  maximum  power.  | 

POLARIZATION 

The  laser  chips  are  polanzed  parallel  to  the  minor  axis  of  ! 
the  ellipse.  The  polarization  of  the  beams  from  our  systems 
is  nominally  horizontal  with  the  laser  system  supported 
normally.  When  the  power  is  measured,  while  rotating  one  of 
our  crystal  polarizers  in  the  beam,  the  ratio  of  maximum  to  i 
minimum  is  typically  75;1. 


Fig.  7  Specfnim  of  Itio  Modoi  79403  laser  rtcorded  with  a  scan¬ 
ning  Fabry  Parol  Inlarfaromatar.  Tha  traa  spectral  range  of  the 
hUartorometer  was  7.5  GHz  and  tha  laser  was  running  at  full 


:iODE  LASER  SYSTEMS 


lOUNTlNG 

"he  laser  can  be  easily  roa  mounted  for  use  on  ootical 
cies  or  oencnes.  There  are  tnree  taboea  1/4-20  noies  in  the 
ier  base.  These  holes  are  m  a  line  oarrallel  ana  under  the 
;t:cal  axis.  The  holes  are  soacea  1.5  inches  (38  mm)  aoart. 

extensive  range  of  rods,  rod  holders  and  accessones 
-I  oe  used  to  nold  the  laser  in  any  onentation.  See  Volume  I 
-  these  products. 

3UPL1NG  THE  OUTPUT  TO  OPTICAL  FIBERS 

"he  output  of  the  small  beam  lasers  may  be  directly 
joied  to  an  optical  fiber  using  the  coupling  systems 
scnbed  on  page  76.  These  fit  directly  on  the  threaded 
tout  of  our  laser  systems.  These  compact  couplers  can 
0  oe  mounted  in  the  77842  holder  for  use  after  the  beam 
5  exited  the  laser  system,  in  either  case  be  sure  that  the 
jbier  Chosen  matches  the  beam  and  fiber  sizes.  For 
?rnative  coupling  arrangements  see  page  76. 


3  0DUCT  LISTING  AND 
^DERING  INFORMATION. 


odd 

10. 

Wiydongtli 

nn 

Power 

mW 

Beam  Slza 
mX  mm 

Bsam  Qlv* 

.  mrad 

Prlea 

'(*1 

1401 

750 

4 

1JX0.7 

<1 

imsm 

1402 

750 

.  < 

4iXZ0 

^8500 

'403 

750  ,  . 

4 

&9X4.0  . 

■  <0.4 

.  STSSilO^ 

1408 

780 

•4 

y  iixa7 

<1  • 

OTSlOO 

•407 

780 

.  4 

4.5X10 

•  <(15 

»7SiU 

'408 

.  780 

4 

a.9X4.0  . 

/  <  0.4 

»75in 

'411 

780 

8 

ijxa7 

<1  - 

•412 

..780 

8 

4SX10 

.  <05 

08550 

413 

780 

8 

8JX4.0 

<14  • 

Qtatw 

'418 

780 

16  • 

iixa7 

<1 

8132550 

417 

780 

16 

4.5  X  2.0 

<05 

81325.00 

418 

780 

16 

8.9  X  4.0 

<0.4 

8132550 

421 

810 

4 

IS  X  0.7 

.<1 

878550 

422 

810 

4 

4iX2.0 

<05 

878550 

423 

810 

4 

8.9  X  4.0 

<0.4 

878550 

428 

830 

25 

1JX0.7 

<1 

8162500 

427 

830 

25 

4iX10 

<05 

81625.00 

428 

830 

25 

8.9  X  4.0 

<0.4 

8162550 

?  50%  intensity  points  are  msasuraO  at  1  m  and  S  nt  from  the 
iiant  output 


-:CIF!CATIONS  COMMON  TO  ALL  ORIEL 

:de  laser  systems 


jt  mode; 

•  soacing: 
zaiion  Ratio; 
"g  Staoilify; 
•90  Amoient; 


Single  Transverse 

Single  Longitudinal  at  full  oower 

Aoproximateiy  0.32nm 

Better  than  50;1 

Better  than  O.i  mrad 

-10  to  50* C 


Dimensions  of  the  C:  •n  Diode  Laser  Systems 


Olwienaiona  In  Inches  (nun). 


CORH  warning  logotypea,  sinttlar  to  that 
mown  above,  appear  on  each  laaar  to  IndK 
cala  the  CORH  classtficatton  and  to  certify 
tttat  the  ou  :  w.  jower  ol  the  laaer  will  not 
j  exceed  the  -jowef  level  printed  on  the 

I  logotype. 


i  •  -AX 


iO'c.  and  rejection  and  aosorotion  are  boch  approximately 
■ .  >/tih  the  s-  and  p-poianzed  components  wnhin  10%  of 
otner.  Over  a  wide  range  of  wavelengths,  the  s-  and  p- 
^iiunzed  components  stay  within  10%  of  each  other.  The  broad- 
'jnu  spectral  flatness  of  these  beamspliners  makes  them  ideal  for 
-jveiength  scanning  instruments. 

sinotner  advantage  of  these  hyorid  coatings  is  that  they  are 
,iriv  insensitive  to  changes  in  the  angle  of  incidence.  .Although 
itioe  .oeamspiitter  configuration  usually  causes  the  angle  of 
■ciccnce  to  be  45“.  this  angular  invariance  in  the  pen'ormance 
•jj  .-le  advantage  of  making  these  beamsplitters  cnromaticaily 
•;'jtrai  for  convergent  or  divergent  beams. 

Meiies  Griot  hyond  coatings  cover  the  visible  spectrum  (400* 

■  *'nmi  or  the  very  near-inirared  for  laser  diode  use  (700- 
.<inmi  Longer  or  shoner  waveiengui  versions  of  this  coating 

■  ■i  '.ot  possible  due  to  severe  matenai  limitations. 


broadband  wavelength  range,  averaged  over  plane  s-polarizaiion. 
However,  they  are  extremely  polarization  sensitive,  with  the  n- 
and  p-components  diffenng  by  as  much  a.s  70%  Great  care 
should  therefore  be  taken  when  using  these  broadband  hcamspUt- 
ters  to  consider  polarization  implications  for  the  optical  system 
into  which  they  are  to  be  integrated. 

Meiies  Griot  offers  broadband  partial  rejection  coatings  for 
the  visible  spectrum  (450-650nm).  the  laser  diode  short- 
waveband  (650-900nm).  the  neodymium  laser  region  (900- 
nOOnmi.  .ind  the  laser  diode  telecommunications  waveband 
(I300-I600nm). 

ALL-DIELECTRIC  IMON-POLARIZING  COATINGS 

This  range  of  cube  beamsplitter  coatings  is  intended  for  those 
applications  where  polanzation  effects  must  be  kept  to  an  abso¬ 
lute  minimum.  Unlike  the  hybrid  coatings,  these  dieleanc  coat¬ 
ings  are  designed  for  high  pen'ormance  at  specific  wavelengths, 
wnere  they  easily  exceed  the  pcnormance  of  any  other  available 
beamsplitters.  Being  totally  dielectric,  they  have  negligible 
aosorption. 

At  the  design  wavelength,  each  of  these  beamsplitters  reflect 
50  r  5%  of  incident  light.  The  s-  and  p-components  of  the 
reflected  (and  therefore  transmitted)  beam  differ  by  less  than 
5%.  (i.e.  each  is  within  of  the  average  polarization  per¬ 

formance). 

The  current  range  of  these  beamspliners  covers  two  important 
laser  waveieneths.  but  other  wavelengths  can  be  accommodated 
on  special  request.  The  laser  wavelengths  are  the  red  helium 
neon  line  at  633nm.  and  the  780nm  laser  diode  wavelength. 


ANTIREFLECTION  COATINGS 

.All  our  cube  beamspliners  are  antireflection  coated  on 
ail  four  faces  to  minimize  ghost  images  and  relleciion 
losses.  They  are  normally  supplied  with  a  multilayer 
HEBBAR'™  antireflection  coating.  TTiere  is  no  need  to 
append  a  Coating  Suffix. 


CUBE  BEAMSPLITTERS 


Cube  beamspiiners  have  several  advantages  over  plate  beam- 
spliners  and  are  widely  used  tor  :ne  rollowing  reasons.  These  are 
rugged  beamspiiners  which  are  exsy  to  mount  and  are  ideal  for 
beam  superposition  applications.  This  type  of  beamsplitter 
deforms  much  less  when  suoieaed  to  mechanicai  stress  than  does 
a  plate  beamspliner.  Most  of  the  unwanted  reflections  from  a 
cube  beamspliner  are  in  the  retro  direction  and  do  not  contribute 
to  ghost  images.  The  coating  is  very  resistant  to  degradation  with 
time  since  it  is  sealed  within  the  body  of  the  cuoe. 

If  cuoe  beamspiiners  are  used  in  convergent  or  divergent  por¬ 
tions  of  an  optical  beam,  they  will  contribute  substantial  amounts 
of  unwanted  aberration.  This  can  be  avoided  or  .minimized  by 
only  using  these  components  with  collimated,  or  nearly  colli¬ 
mated.  beams.  Conjugate  distances  which  Include  cuoes  there¬ 
fore  snould  be  long.  .•Mtemateiy.  other  elements  of  the  system 
can  be  designed  to  compensate  for  any  aoerrations  introduced  by 
the  cuoe  in  a  non-collimated  beam. 

Cube  beamsplitters  consist  of  matched  pairs  of  identical  nght- 
angle  pnsms.  with  their  hypotenuse  faces  cemented  together. 
Prior  to  cementing,  a  partial  retlection  film  is  deposited  onto  one 
of  the  hypotenuse  faces.  The  pnsm  which  is  coated  is  marked 
with  a  small  dot  on  one  of  the  ground  faces.  For  best  results,  the 


incident  beam  should  be  on  one  of  the  faces  of  this  pnsm 

We  currently  offer  three  types  of  cube  beamsplitter,  ciassi 
by  the  nanire  and  penormance  of  the  panial  reflection  coat 
used.  The  metal-dieieCTnc  hybnd  coatings  reflect  and  tranv 
equal  portions  of  incident  light  with  10%  aosorption  and  . 
fairiy  small  differences  in  s-  and  p-polanzation  over  an  exte.% 
waveiength  range.  The  broadband,  ail-dielemnc  coatings  aov. 
negiigible  amounts  of  the  incident  intensity,  but  the  s-  anc 
poianzed  penormance  cnaractenstics  are  quite  different.  In  i. 
non.  .Melles  Gnot  now  offers  a  range  of  non-polariz:  j 
all-dielectnc.  partial  retlection  coatings  for  these  cube  beamsr 
ters  wnich  have  almost  identical  s-  and  p-reflectances  at  spec 
laser  wavelengths. 

Each  of  the  coating  types  has  its  panicular  ments  and 
tations.  so  your  selection  snould  depend  on  the  inten^ 
application. 

HYBRID  PARTIAL  REFLECTION  COATINGS 

Hybnd.  metal-dieiectnc  coatings  combine  the  benefits  of  b 
metals  and  dielectncs  to  produce  a  moderate  absorption  bet 
splitter  with  little  polarization  sensitivity.  Typically,  absorptxx 


iO‘'c.  jna  retlection  and  absorption  are  both  approximately 
.  vviin  me  s-  and  p-poiarized  components  within  \0%  of 
;acn  otner.  Over  a  wide  ranee  of  wavelengths,  the  s-  and  p- 
'.iianzeo  componem.s  stay  wnhin  lO't  of  each  other.  The  broad- 
'jnd  spectral  flatness  of  these  beamsplitters  maices  them  ideal  for 
.jveieneth  scanning  instruments. 

\noiner  advantage  of  these  hyond  coatings  is  that  they  are 
jr;v  insensitive  to  changes  :n  the  angle  of  incidence.  .Although 
'i  cuoe  oeamspliner  configuration  usually  causes  the  angle  of 
rcicence  lo  oe  a5°.  this  angular  invariance  in  me  pen'ormance 
Ms  .ne  aovaniage  of  making  these  oeamspiitters  chromatically 
■.’•jtrai  tor  convergent  or  divergent  beams. 

Meiles  Griot  hyond  coatings  cover  me  visible  spectrum  (400- 
■  Onmi  or  the  very  near-infrared  for  laser  diode  use  (700- 
COnmi.  Longer  or  shoner  wavelength  versions  of  this  coating 
not  possible  due  to  severe  matenai  limiutions. 


’  T 


partial  RErLECTlON  COATINGS 

,  '  'wiana  oartiai  rejection  coatings  orovide  a  hign  oegree  of 
There  s  negligible  aosorotion  m  the  coating,  and  the 
I  ■  jno  transmiiteo  comtsonents  are  almost  eouat  over  a 


broadband  wavelength  range,  averaged  over  plane  s-polanzation. 
However,  they  are  extremely  polarization  sensitive,  with  the  s- 
and  p-components  differing  by  as  much  a.s  70  Great  care 
should  therefore  be  taken  when  using  these  broadband  beam.splti- 
ters  to  consider  polarization  implications  for  the  optical  system 
into  which  they  are  to  be  integrated. 

.Meiles  Griot  offers  broadband  paniai  retleaion  coatings  for 
the  visible  spectrum  (450-650nm).  the  laser  diode  short- 
waveband  (650-900nm).  the  neodymium  laser  region  (900- 
13(X)nm).  and  the  laser  diode  telecommunications  waveband 
(13CX)-I600nmi. 

AU-DIELECTRIC  NON-POLARIZING  COATINGS 

This  range  of  cube  beamspliner  coatings  is  intended  for  those 
applications  where  polanzaiion  effects  must  be  kept  to  an  abso¬ 
lute  minimum.  Unlike  the  hybrid  coatings,  these  dielectric  coat¬ 
ings  are  designed  for  high  pen'ormance  at  specific  wavelengths, 
where  they  easily  exceed  the  penormance  of  any  other  available 
beamsplitters.  Being  totally  dielectric,  they  have  negligible 
absorption. 

At  the  design  wavelength,  each  of  these  beamsplitters  reflect 
50  -  5%  of  incident  light.  The  s-  and  p-components  of  the 
refleaed  (and  therefore  transminedl  beam  differ  by  less  than 
5%.  (i.e.  each  is  wuhin  r--5fo  of  the  average  polarization  per¬ 
formance!. 

The  current  range  of  these  beamsplitters  covers  two  important 
laser  wavelengths,  but  other  wavelengths  can  be  accommodated 
on  special  request.  The  laser  wavelengths  are  the  red  helium 
neon  line  at  633nm.  and  the  780nm  laser  diode  wavelength. 


ANTIREFLECTION  COATINGS 

.All  our  cube  beamspliners  are  antireflection  coated  on 
ail  four  faces  to  nunumze  ghost  images  and  reflection 
losses.  They  are  normally  supplied  with  a  multilayer 
HEBBAR"^  antiretlection  coating.  There  is  no  neec  to 
append  a  Coating  Suffix. 


SPECIFICATJONS;  CUBE  BEAMSPUTT6RS 

Dimensions:  ±0.3tnm 

Maienai:  BK  7  grade  A  fine  annealed 

Face  Ratness:  2X  per  clear  aperture  ac  546nm 

Transmission 

Hybrid:  45  ±  6%  (±5%  variation  with  waveleng'h) 
Broadband  Dielectric:  50  ±  5%  (±3%  variauon  with  wave¬ 
length),  for  average  polarization 

Non-Polarizing  Dielectric:  50  ^  5%  for  any  polanzation  with 
the  s-  and  p-components  matched  to  within  3% 

Absorption 
Hybnd:  10% 

Broadband  Dielectric:  <0.5% 

Non-Polanzing  Dielednc:  <0.5% 

Coatings:  All  4  faces  HEBBAR™  antireflection  coated 
Cosmetic  Surface  Quality;  60-40  scratch  and  dig 


Melles  Choi 's  phsm  tobies  and  post-mounted  pnsm 
holders  are  ideal  for  holding  these  cube  beamsplirrr"  •- 
an  adjustable  yet  stable  manner.  See  the  Component  - 
ers  seaton  of  this  guide  for  a  complete  desenpu: 
listing  of  these  useful  mounts. 
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Cube  Beamsplitters:  Visible  NVaveleneth  Ranee 
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.•aimplilters:  Near-fR  Wivelencth  Ranse 
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